2203.11906v1 [astro-ph.CO] 22 Mar 2022 


. 
. 


arXiv 


X-ray cluster cosmology 


Nicolas Clerc* and Alexis Finoguenov‘ 


Abstract Formation of dark matter halos is sensitive to the expansion rate of the 
Universe and to the growth of structures under gravitational collapse. Virialization 
of halos heats the gaseous intra-cluster medium to high temperatures, leading to 
copious emission of photons at X-ray wavelengths. We summarize the progress of 
X-ray surveys in determining cosmology using galaxy clusters. We review recent 
cosmological results based on cluster volume abundance, clustering, standard can- 
dles, extreme object statistics, and present relevant theoretical considerations. We 
discuss clusters as gravitation theory probes and present an outlook on future devel- 
opments. 
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1 Introduction: Role of X-rays in cluster cosmology 


X-ray emission from clusters is predominantly of thermal origin, a convincing ob- 
servational proof being the presence of atomic emission lines on top of bremsstrahlung 
continuum in X-ray spectra. The X-ray emitting gas, generically identified to the 
intra-cluster medium (ICM), fills the entire volume of massive, Mpc-sized clusters, 
within which hundreds to thousands of galaxies orbit under the influence of the grav- 
itational force. When gas and galaxies are in the same gravitational potential, and 
galaxies are in virial equilibrium within the cluster potential, assuming gas particles 


have roughly similar three-dimensional velocity dispersion as galaxies?: 


kaT / ump ~ 02 ~ GM /3R (1) 


where Miot and R are typical mass and radius for a cluster, kg the Boltzmann con- 
stant, mp a proton mass and G the Universal constant of gravitation. Considering 
galaxy radial velocity dispersion o, ~ 1000kms~! and mean molecular weight 
u ~ 0.6 provides a temperature T ~ 7 x 107 K, equivalently kgT ~ 6 keV. The gas 
with electron density n, ~ 107? cm7? is heavily ionized at this temperature. Ther- 
mal bremsstrahlung is the main X-ray emission mechanism in hot clusters, whose 
emissivity scales with the product of npne, with the proton density np ~ ne/1.2. 
Also, the plasma is optically thin to the emitted photons*, so the black body spec- 
trum does not form. The number of emitted photons is much lower than the number 
of baryons, and the emission is called diffuse, similar to supernova remnants, but 
different from compact sources where photon ionization is important. Outside of 
the cluster virial radius, where the density drops, in the presence of background UV 
radiation field, the ionization processes start to play a role [114]. At temperatures 
below 1 keV processes driving line emission become relatively more important and 
most of them also scale with npne (e.g. bound-bound transitions due to collisional 
excitation). 

The total X-ray luminosity of the gas scales with the square of density times the 
volume V; while the total gas mass scales with n,V. Based on these arguments we 
expect total mass to correlate strongly with gas temperature, gas mass and X-ray lu- 
minosity, all of them are retrievable from the X-ray spectra, along with metallicity, 
which controls the contribution of the line emission to the total X-ray luminosity, 
described above. The importance of X-rays for cosmology can be understood from 
this scaling. Massive virialized halos are easily identified as luminous X-ray ex- 
tended sources and their masses are readily available from X-ray measurements, 
once the source distance is known. The distance is obtained from a measurement of 
the cluster redshift. Redshift is most commonly inferred from optical observations 
of cluster galaxy members?. Furthermore, line-of-sight projection effects are low 
thanks to the emissivity scaling with n?. 


3 Numerical simulations show that galaxies move slightly supersonically through the ICM [69]. 
^ Notable exception from this picture is driven by resonant absorption lines in cluster cores [83]. 
> Resolved emission lines in X-ray spectra may also provide a redshift measurement [263, 29]. 
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Detection of the cluster emission in X-ray requires space instrumentation due 
to opacity of the Earth atmosphere, and strongly depends on the energy band: ex- 
tremely soft X-rays with energies < 0.3 keV are strongly absorbed by the Milky 
Way halo and interstellar medium, and local foregrounds are high, so it is a rela- 
tively poorly understood regime, but important for low mass halos. The standard 
band for cluster detection is 0.5-2.0 keV, employed by ROSAT, Chandra, XMM- 
Newton, SRG/eROSITA studies. Slight variations of the choice of the band in order 
to increase the sensitivity of the survey are often made, as current X-ray instru- 
mentation records? the energy of the detected photon and bands are not pre-set, 
unlike in the optical/IR photometry. At energies above 2-2.5 keV, the reflectivity 
of the mirrors drops (and off-axis reflection efficiency drops even more) with the 
exact behaviour dependent on the material (with leading performance of the gold 
coating), which reduces the collecting area of the X-ray telescope by an order of 
magnitude [267, and Section “Optics for X-ray Astrophysics” in this handbook]. In 
addition, the emission of high-redshift clusters as well as galaxy groups is soft, due 
respectively to redshifting of the spectrum and to lower gas temperature resulting in 
thermal cut-off of the emission. 

In order to be a competitive cosmological probe, detection of clusters has to be 
reproducible, robust, and sensitive to cosmology. Sensitivity to cosmology is due to 
scaling relations between X-ray properties and cluster total mass (for more details 
on this read Chapter “Scaling relations of clusters and groups, and their evolution"). 
The actual observing tests use X-ray luminosity function of clusters, temperature 
function, as well as combination of gas mass and temperature (aka Yy). The next 
sections provide multiple references to these works. 

Collecting area of X-ray telescopes has been the largest challenge in the fea- 
sibility of cluster detection, due to inefficiency of X-ray reflection, requiring inci- 
dence angles close to zero, which correspondingly zeros out the mirror collecting 
area. The collecting area of current instrumentation is ~ 1000 cm?, while in the 
future a gain by an order of magnitude is expected from Athena. The sky back- 
ground in the 0.5-2 keV is 10~! ergs s~! cm~? arcmin ?, and the all-sky sensitivity 
achieved by eROSITA is 10~'* ergs s! cm~? arcmin ?. Deeper fields observations, 
performed by Chandra and XMM-Newton reach 10716 ergs s^! cm~? arcmin ?. The 
cluster size to cluster flux relation, as well as a confusion limit, implies that instru- 
ment resolution has to be better than arcminute at depths (in units ergs s~! cm?) 
below 10-2, < 30" below 10713, < 10" below 10714, < 5" below 107, reaching 
1 " at 1077 ergs s^! cm? arcmin 7. So, low resolution telescopes can only effec- 
tively perform shallow surveys, while for high-resolution telescopes, like Chandra, 
only Msec exposures open up unique advantages. The number of clusters per square 
degree scales from 0.3 for ROSAT All-Sky Survey (RASS) [76] to 3 for eROSITA 
[145], 15 for wide XMM-Newton/Chandra surveys [188], 100-1000 for deep and 
ultra-deep XMM surveys [74, 78]. 

For comparison, red sequence cluster search yields 5—10 clusters per square de- 
gree, depending on the depths of optical/NIR imaging [210, 211]. Spectroscopic 


$ Albeit with low resolution - improvements involve the use of calorimeters, e.g. X-IFU [13]. 
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catalog of galaxy groups yield 10-1000 systems per square degree [81, 121, 237], 
but large area surveys are still limited to z < 0.1. The main bottleneck of opti- 
cal surveys are the projection effects, which limits the mass range accessible to 
red sequence studies and overmerging/oversplitting problem affecting spectroscopic 
galaxy groups as well as developing an estimator of group mass, which in most al- 
gorithms comes with large scatter [176, 177]. The strength of X-rays is to improve 
on the purity and completeness with respect to optical catalogs. As X-ray emission 
traces the virialization of gas, it characterizes the central part of the group, typi- 
cally extended to the scale radius’ Rsoo, while efficient galaxy group search using 
spectroscopic surveys requires one to consider much larger radii. As a result, not all 
optical groups are supposed to be detected at X-rays, but only those with masses 
within R59o above X-ray sensitivity limits. The open issue of effect of Active Galac- 
tic Nuclei (AGN) feedback on the intergalactic medium (IGM) further reduces our 
ability to predict X-ray properties of optical groups. Nevertheless, the cosmology 
using X-ray galaxy groups yields consistent results to clusters, which indicates that 
the prospects of understanding galaxy groups through X-rays are bright. 


2 Role of massive halos in cosmology 


Numerical simulations are generally required for a complete and detailed theoretical 
modelling of the formation of dark matter halos. However a ‘first order’ treatment 
highlights many of the features relevant to cluster cosmology, such as the exponen- 
tial decrease in the abundance of high-mass objects and the roles of matter-energy 
components of the cosmological model on the distribution of halos. This section 
provides a few elements of the theory of structure formation. By no means it is a 
comprehensive overview of the topic, which may be found in suitable cosmology 
textbooks. We take this opportunity to introduce some key parameters and notations 
frequently referred to in this Chapter. 


2.1 The homogeneous model 


The homogeneous, isotropic background as described by the Friedmann-Lemaitre- 
Robertson-Walker metric involves the dimensionless scale factor a(t) = (1 +z)71, 
with z representing the redshift and t a universal cosmological time coordinate. Its 
evolution is governed by Einstein field equations linking curvature in the metric to 
the energy-momentum tensor, involving in particular the fundamental constants G 
(Newton gravitation constant) and c (speed of light in the vacuum). This tensor en- 
capsulates contributions from non-relativistic matter with energy density Pm much 


greater than pressure p»/c?; and from relativistic species (including radiation) of 


7 See Sect. 2 for the definition of Rsoo. 
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density p, = 3p,/c?. As a consequence of field equations, the Hubble parameter 
H (a) = d(Ina)/dt = à/a for a model with cosmological constant? A writes: 


1/2 


—— =E(z) = (Qna 340.44 + Oxo ?-E QA) (2) 
with Q; = p;/p. the present-day density of each component i, expressed relative 
to the critical density p. = 3H2/(8xG) ~ 1.9 x 1079? gcm 2, and Ho = H(z = 
0) = 1004 km s^! Mpc^! the present-day Hubble constant. This Chapter discusses 
structure formation at epochs well after the early Universe, one therefore neglects 
the relativistic component density. The curvature parameter Qk = 1 — Qm — Qa van- 
ishes in a flat Universe, although some of the cosmological tests using clusters are 
sensitive to deviations from flatness. 

Equation 2 applies to the Lambda Cold Dark Matter model (ACDM). The for- 
mulation is slightly modified when accounting for a Dark Energy equation of state 
px = wpxc?. The case of a cosmological constant is recovered with w = — 1. Con- 
servation of the energy-momentum tensor implies p;(z) e« (1 +z)? t) for each of 
its components with constant equation of state parameter w;. Furthermore the pa- 
rameter w may vary with redshift; a standard, although not unique, parametrization 
of the Dark Energy equation of state reads w(a) = wo + w1(1 — a). The evolution 
factor in this case writes: 


E(z)* =Qm(1 +z)? - O.(1--z)* +O (1 +2)? 
+Ox(L+z)3U two) e 3wiz/(14-z) (3) 


So far we did not distinguish neutrinos from the other energy components. After 
decoupling from the primordial plasma, neutrinos are relativistic and participate to 
p(z) along with photons, in a relative amount that scales with the effective number 
of relativistic neutrinos species Neff (= 3.046 in the standard model). As Universe 
expands, the temperature of these relic neutrinos decreases as a^ !. Due to them be- 
ing massive, neutrinos become non-relativistic at some redshift znr; past this point 
in time, they contribute to pm(z). An approximate scaling for a neutrino of mass 
my reads: 1+ zy; ~ 2000 x (my /1eV). Their contribution to the present-day matter 
density Qm writes Qyh? = Y (my /93.14eV), with Y; my the (non-zero) sum of neu- 
trino masses. The formula for Qy holds even if one of the three neutrino states is 
still relativistic today [138]. 

The text in this Chapter refers to the usual definitions of distances in cosmology 
[e.g. 96]. The angular distance D4(z) relates the apparent angular size 60 to the 
proper physical size D of an object located at a redshift z, such that D = 60DA(z). 
The luminosity distance D; (z) relates the bolometric flux 5 to the bolometric lumi- 
nosity L of a source at redshift z, such that L — 47D? (z)S. Both their expressions 
involve the evolution factor E (z) introduced in Eq. 2 and 3. 


8 Equivalently, a component of density pa = Ac? / (87G). 
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2.2 Linear growth of matter perturbations 


The growth of matter inhomogeneities that led to present-day galaxy clusters is 
described to good approximation in the weak-field limit of General Relativity; in 
other terms, the Newtonian approximation applies to the evolution of fluctuations on 
scales L «& cH !. The derivation focuses on small departures from the homogeneous 
(‘background’) model in which structures form. The Newtonian potential ® for an 
ideal fluid with mass density p and pressure p with velocity v < c is: 


V2 — 4nG(p +3p/c?) — Ac? (4) 


with F = a(t)x the proper separation between two elements in the fluid and X is 
the comoving separation. The equation of motion is obtained from the geodesic 
equations and reads d*7/dt? = —V;®. Applying this equation to a fixed pair of 
particles in the background (i.e. with x held constant) leads to the cosmological 
background model equation: 


1 d?a 4 j 5 
—— =— TG [pp (t) + 3pp(t)/c*] + Ac*/3 (5) 
a dt 3 

in which index b indicates quantities related to the background homogeneous model. 
Eq. 5 is also a consequence of the Einstein field equations and often is called Fried- 
mann’s equation. In considering the motion of freely moving particles in the ex- 
panding Universe, it is more convenient to form the potential @ = + adix* /2 whose 
Laplacian with respect to coordinate X is: 


V26 = An Ga py(t)Ó (X,t) (6) 


with ô = [p(X,1)/pp(t) — 1] the overdensity contrast at comoving coordinate x. 

The next steps in the derivation involve the description of matter particles as a 
(ideal, pressureless) fluid in the potential @, leading to the matter conservation equa- 
tion and the equation of motion in comoving coordinates. This requires introducing 
the proper velocity ii = ¥+ aX of the fluid, with Y = ax the peculiar velocity of parti- 
cles relative to the background model. Combining those equations and linearizing in 
ô « lI and in v < (L/T) (T being the expansion time of order /Gp;), one obtains: 


à --2HÓ — AnGp,6 — 0 (7) 


In this equation H (z) acts as a damping term. In particular, as expansion accelerates 
the damping timescale becomes smaller than the characteristic time for a pertur- 
bation to grow and structure growth stops. The equation accepts a decaying and a 
growing mode. The latter dominates after some time and the growth factor is D(t) 


such that: 50. i 
= m a NFZ yy 
ó(X,t) oc D(t); D(z) = =z E() j EG 7 (8) 
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The steps in this derivation also lead to the evolution of the linerarized peculiar 
velocity field v; it scales linearly with the quantity f = dInD/dlna, for which a 
convenient approximation exists: 


f ~Qm(a)” ; with y ~ 0.55 and Qn (a) = Osa ?E(a) ? (9) 


This approximation is useful in tests of General Relativity probing the evolution of 
the growth factor. 


2.3 The smoothed linear density field 


Introducing the Fourier transform of the matter overdensity field, ô, the power spec- 
trum of the linearly evolved matter field at any redshift is such that: 


P, (k,z) = |&;|? e« D? (z)T? (k)Po(k) (10) 


Here Po represents the spectrum of fluctuations at early times, that is typically a 
scale-free power law of the form œ K'*, a value n; = 1 indicates scale-invariance?. 
The transfer function T (k) quantifies the evolution of each mode relative to the 
evolution of long-wavelength modes. Its calculation is synthesised in formulas fitted 
onto numerical simulations. Modern codes [e.g. 139, 137] incorporate in particular 
the effect of massive neutrinos. Their impact on the power-spectrum - relative to a 
spectrum computed with massless neutrinos [e.g. 65] — is twofold: i) a reduction of 
the growth rate of cold dark matter fluctuations due to the non-vanishing Qy term 
in the expansion rate (Eq. 2), while their contribution to p, in the Poisson equation 
(Eq 6) remains negligible at small scales!?; ii) a shift in the time of matter-radiation 
equality. A useful approximate formula to gauge the suppression of power in the 
matter power-spectrum at large k > knr is: Pr (K,z, fv) ~ —8 fy x Pr(k,z, fy — 0) with 
fv = Qy/Qm, as long as fy S; 0.07 and z S znr [103]. The typical scale below which 
damping occurs is such that knr ~ 0.026(my/1 ev)! 2917 Mpc^!. 

Smoothing of the linearly evolved density field at a length scale R involves con- 
volution by a spatial filter (or "window function") Wr(x); the variance of the filtered 
density field writes: 


1 ^ 
o^(R,z) = oh = zz | PED Wel fias (11) 


where a hat symbol stands for Fourier transform. The common top-hat three- 
dimensional window function is defined such that it encloses a mass M = 4x R? p, /3 
within the sphere of radius R. This expression enables defining 6 (M,z), the vari- 
ance corresponding to a characteristic mass. The specific value obtained by setting 


9 Cosmological microwave background (CMB) measurements provide ns = 0.965 + 0.004 [193]. 


10 This is due to the large free-streaming length of neutrinos preventing their fluctuations to grow 
on scales smaller than they can travel in a Hubble time tg = 1/H. 
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R = 8h^! Mpc (and z = 0) defines the og parameter which is well constrained by 
galaxy cluster experiments. 


2.4 Departures from linear growth 


In the hierarchical scenario for structure formation, fluctuations grow linearly until 
their overdensity contrast reaches 6 ~ 1 when they start detaching from the expan- 
sion of the background model; their subsequent evolution is driven by gravitational 
collapse. Relevant notations and concepts are commonly introduced with the sim- 
ple picture of a spherically symmetric overdensity that is embedded in a uniform 
background of matter. This spherical collapse model takes roots in early theories of 
cosmological structure formation. 

Spherical collapse equations invoke Birkhoff’s theorem in General Relativity in 
order to describe the evolution of a small perturbation satisfying spherical sym- 
metry in an Einstein-de Sitter Universe, that is a flat matter-dominated Universe, 
in which p,(t) œ 12/3 (Eqs. 2 and 5 for Qn = 1 — Oy = 1). The equations rul- 
ing the evolution of the perturbation are similar to those governing the expansion 
of an overcritical model with perturbed density p = pp(1 + ô). Friedmann’s equa- 
tions lead to R/R = —GM/R? with R(t) the radius of a sphere enclosing a mass 
M = Aapy(1 + 6)R? /3. The solution is such that R grows until it reaches a max- 
imum Rn at given time tm (turn-around time) and then decreases until te = 2tn, 
time at which the structure collapses (R — 0). Due to imperfect spherical symmetry 
and inhomogeneities, complete collapse does not happen. The structure virializes 
at time fyir ~ 2tm and its radius is fixed at the value given by the Virial theorem, 
Ry = Rm/2, that is reached some time before virialization. Computation of the 
mass density within Ryir provides Ayir = Pvir/Pp(tvir) = 1827 ~ 178; this value is 
the typical ‘virial overdensity’ at the time of halo formation in a flat Einstein-de Sit- 
ter Universe. 

On the other hand, the evolution of the overdensity contrast 6 may be lin- 
earized such that Ój, ~ D(t). The value of dj, extrapolated at f = te provides 
ôe = (te) ~ 1.69. This characteristic overdensity contrast can be interpreted as a 
threshold; a structure will collapse if its corresponding linearly extrapolated over- 
density reaches 6, in the course of its evolution. Therefore, it is convenient to com- 
pare the normalized height of peaks 6 / 06 (M) in the smoothed density field with the 
threshold v = ó./6 (M). Hence, at each redshift z there exists a typical mass M" (z) 
describing the scale of collapsing structures; it is such that v(M*) = 1. 

The spherical collapse model is extended to background cosmological models 
deviating from the flat, matter-dominated Universe. Inclusion of a cosmological 
constant leads to small corrections on 6, [117, 253]. An evolving equation of state 
for dark energy translates into small changes in the expression of the spherical col- 
lapse threshold [e.g. 256, 17, 100, 154, 181]. 

A frequent notation for galaxy cluster masses is MA. It corresponds to the mass 
enclosed in a radius RA within which the mean matter overdensity is A. A letter 
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indicates whether the overdensity is computed relative to the background p; (with 
letter ‘b’ or ‘m’, e.g. M200b, R200m) or the critical density p. (e.g. Msooc, R5000), all 
densities computed at the redshift of the cluster: 


4 4 
Mac = z AR cPe(Z)A and Maj = z 7RapPo(z)A (12) 


2.5 The halo mass function and abundance of clusters 


By identifying collapsed halos of mass M with loci in the smoothed linear density 
field passing over the threshold 6,, and with the assumption of Gaussian-distributed 
6’s, Press & Schechter [198] derived a decomposition of the halo mass function in 
the following form: 

dn pp dino! 

am 052 = foy din 
with o = o(M,z) previously defined and dn/dM the comoving volume density 
of halos per mass interval. Notably, f(o) presents an exponential cut-off at high 
masses, with f(6) « &,/o x exp [- 5(&./0)?]. This reflects the observational fact 
that massive clusters are rare entities. Although the original expression for f(o) 
suffers from several approximations (in addition to those inherent to the spheri- 
cal collapse model) that makes it too imprecise for current studies, subsequent nu- 
merical simulations of structure formation showed the adequacy of this decom- 
position in reproducing the distribution of halos in standard cosmologies [e.g. 
229, 108, 238, 190, 52, 60, 47, 162]. In practice, these studies show that f and 
its parametrization slightly depend on cosmological parameters and on redshift z, at 
least to precision levels of order 5%, that are relevant for precision cosmology. 

A more generic formulation for the mass distribution of halos is [52]: 


dn ^ py 1 do ESO (14) 


(13) 


dM Mo?dM Jo 96 Oo 


where .7" (ô, v) is a ‘selection function’ and .Z is the probability distribution func- 
tion of the primordial density fluctuations smoothed on scale M. The .-function en- 
capsulates all effects resulting from the non-linear collapse; in the Press-Schechter 
model that led to the decomposition of Eq. 13, . 7 is the Heaviside function on Óó.. 

The model presented throughout this Section clarifies the origin of the sensi- 
tivity of halo abundances to cosmological parameters. Of them, Qm and og are 
especially well constrained by local cluster surveys as they strongly influence the 
number density of massive halos at late times. The parameter Sg is a combination 
of these two parameters. It is introduced by some authors in an attempt to reduce 
correlations (i.e. degeneracy in constraints) while fitting cluster abundance data. 
Frequently found definitions are Sg = ogQ!, and Sg = 63(Qm/0.3)', with I a fixed 
numerical exponent suited to a specific experiment (typically I = 0.5). 
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3 Galaxy cluster abundances in X-ray surveys 


Galaxy clusters are rare entities, with present-day volume abundance of the order 
1075 Mpc~? at masses above 10!4Mo and 107? Mpc ^? at masses above 10!° Mo. 
Determining precisely their abundance is a key to understanding how structure 
grows in the Universe, what are the processes governing halo formation, and ul- 
timately to constrain cosmological models. Clusters may be counted as a function 
of any of their properties or combination thereof: flux, luminosity, temperature, etc. 
Cluster mass is the most interesting property as it is predicted by the halo forma- 
tion model presented in Sect. 2. At first sight, any count experiment is related to 
the mass function in some way and all abundance functions are a different repre- 
sentation of this same and unique distribution. However there is no such thing as a 
direct cluster mass measurement, and comparing various abundance functions en- 
ables cross-validations of models, extraction of unexpected systematic biases and 
tightening of the mass-observable relations. We select and describe recent progress 
in the counting experiments of galaxy clusters and their derived cosmological con- 
straints. 


3.1 X-ray mass estimate: hydrostatic and proxies 


In addition to tracing the hot gas content, the spatial distribution of gas is sensitive 
to the total mass distribution. The simplest way of solving the dependence is to 
assume that the gas is in hydrostatic equilibrium (HSE), so that acceleration and 
velocity terms in the Euler equation can be ignored. This leads to an equation for 
the encompassed mass within a radius r: 


Muse (< r) = 


kgTr [mne | eer]. is 


Gum, | dlnr ` dlnr 


where T is a temperature of the gas and Pgas - its density, LL - mean molecular 
weight. The systematic uncertainty associated with this technique was calibrated 
using numerical simulations and it is thought to be 10% [219]. These mass estimates 
were used to derive the first estimate of the gas fraction, and to calibrate the cluster 
scaling relations [75, 249, 7, 197]. 

The assumption of gas pressure exactly counterbalancing the gravitational pull 
is subject to numerous investigations. The reader looking for an entry point into a 
large body of literature will find in Pratt et al. [196] a recent review with focus on 
the importance and the impact of an accurate galaxy cluster mass scale on cosmol- 
ogy. Numerical simulations play a central role in the examination of the bias and the 
scatter induced by the HSE assumption. All major simulation results obtained in the 
last decade indicate an underestimate of Mysg with respect to the true total mass of 
a galaxy cluster [see e.g. 82, and references therein]. The amount of bias depends 
on the radial range being considered. HSE-based masses within cluster core regions 
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are generally less biased (at the ~ 5 — 10% level at R2500) than masses within radius 
R59 and beyond, values raising up to ~ 15 — 30% at the virial radius [21]. The bias 
depends critically on the dynamical state of the systems and on the mass accretion 
history of clusters [205]. The assumption of HSE is more likely to fail when ap- 
plied to disturbed systems in contrast to relaxed clusters, and mergers stand out as 
extreme examples of systems for which a simple HSE hypothesis cannot be applied 
blindly. The HSE bias relates to the presence of non-negligible terms in the Euler 
equation for non-viscous fluids. These terms translates into phenomena familiar to 
hydrodynamical fluid studies, once given a smoothing scale to perform spatial av- 
eraging [134]: mean gas rotation and streams, random motions, etc. In particular, 
bulk motions and random turbulent motions are leading the corrections to the HSE 
assumption and contribute via the acceleration and velocity terms. The expression 
for random motions may be recast in the form of a component dubbed 'non-thermal 
pressure’, adding up to the thermal pressure of the gas Pn e PgasT. Simulations in- 
dicate an increase of the fraction of non-thermal pressure ranging from ~ 5 — 15% 
at low radii to 20 — 30% at larger radii, especially in outskirt regions where infall of 
matter on low-density gas regions prevents from reaching HSE quickly [173, 184]. 
Clumping and inhomogeneities in the gas distribution also negatively bias (~ 5%) 
the HSE mass, in that they lead to erroneous estimates of the gas density (and tem- 
perature) profiles from X-ray observations, those quantities entering Eq. 15 [208]. 
Simulations indicate moderate values of the scatter in the Muse — M relation, of or- 
der ~ 10 — 15% at low redshift. The predicted evolution of this scatter is not firmly 
established, some works indicate an increase at earlier cosmological epochs [e.g. 
135] while other point towards constant spread [e.g. 82]. 

On the observational side, results taking as reference weak-lensing-based mass 
estimates in relaxed clusters within R25oọ point towards low (< 10%) bias values. 
This result is consistent with the numerical simulation estimates once allowance is 
made for temperature calibration uncertainties [e.g. 5]. The independent analysis 
of a low-redshift luminous X-ray cluster sample indicates 10 — 1596 lower X-ray 
HSE-based masses within R59) when compared to weak-lensing estimates [231]. 
When considering cluster masses derived from galaxy kinematics as a baseline, a 
bias value of less than 10% is found at high statistical confidence [159]. 

In spite of its limitations, the hydrostatic equilibrium equation (Eq. 15) also il- 
lustrates the origin of scaling between the properties of the gas and the total mass, 
by linking T ~ M (for further details and most recent results on the calibration 
see Chapter "Scaling relations of clusters and groups, and their evolution"). The 
presence of this relation is fundamental to use X-ray surveys to count clusters, as 
parameters of the survey could be linked to the sensitivity towards cluster mass. In 
addition to cluster detection, parameters of the cluster gas can be used to character- 
ize the cluster mass at lower scatter, compared to e.g. weak lensing mass measure- 
ments. These are called mass proxies, with the widely used ones being Lx, T, Megas, 
and their combination, Yx = T Maas. 
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3.2 The X-ray luminosity function 


One of the most studied distribution is the X-ray luminosity function (XLF) of 
galaxy clusters, @(Ly), defined as the number of clusters per unit comoving vol- 
ume and unit luminosity Ly. Constructing such a distribution requires knowledge of 
cluster redshifts, their fluxes and temperatures (for spectral K-correction) and an es- 
timate of a radius, typically Rso9 that comes from a total mass estimate. Moreover, a 
precise determination of the volume probed at each luminosity is key and this links 
directly to the determination of the survey selection function. Rosati et al. [209] 
provided a thorough review of ROSAT-based luminosity functions; at the time of 
their review, data was pointing towards mild-to-no evolution of the comoving space 
density, except that high luminosity clusters were rarer as redshift increases. 

Figure | displays the luminosity functions in the nearby Universe (z < 0.3) es- 
tablished from several new surveys that emerged since then: 160SD [169], WARPS 
[122], REFLEX-I [23], CDFS [78], XMM-XXL [1], SPIDERS [42] and eFEDS 
[145]. When needed, the published luminosity values were converted into the 0.5-2 
keV rest-frame band. There is good agreement between all measurements across 
the entire luminosity range. However, deviations occur near a luminosity ~ 107? 
erg s^!. The apparent deficit of clusters in the 50 deg? XXL survey is found to be 
marginally compatible with cosmic variance effects [179]. The WARPS analysis 
[122] is pointing to an apparent excess of clusters at around ~ 2 x 109 erg s^! ; the 
comparison with REFLEX-II seems to reduce the significance of this effect. 
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Fig. 1 Compilation of low-redshift cluster X-ray luminosity functions from recent surveys. Down- 
ward pointed arrows indicate upper limits. References: WARPS [122]; REFLEX-II [23]; COS- 
MOS+ECDFS [78]; XXLCI [1]; SPIDERS [42]; eFEDS [145]. 
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Figure 2 and 3 shows the luminosity function computed in redshift shells located 
above z = 0.3. This is a major new addition to our understanding of the cluster 
space density distribution. These high-redshift densities are in agreement with each 
other, and no evolution of the comoving space density is observed up to z ~ 0.5 — 
0.6. Beyond z ~ 0.6, the cluster luminosity functions determined by several surveys 
point towards a decrease in the normalisation at low-luminosities, while the WARPS 
analysis [122] indicates no major evolution at the high-luminosity end. 
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Fig. 2 Figure similar to Fig. 1, displaying the X-ray luminosity function at higher redshifts, around 
z œ 0.4. The local REFLEX-II luminosity function is shown as a dashed line. References: 160SD 
[169] and see Fig. 1. 


There are several ways to estimate the XLF from a galaxy cluster sample. A 
widespread technique involves grouping clusters in bins of luminosity, making easy 
the visualization of the cluster population and its evolution. It is worth cautioning 
the reader about biases affecting such ‘binned’ estimators, these biases depending 
on each survey’s completeness limit [e.g. 182]. The XLFs compiled in Figs. 1-3 
originate from their respective publications and these results rely on various estima- 
tors, each has its own bias correction scheme and its own bin-to-bin covariance [see 
179, for a discussion]. 

Cosmological constraints from the XLF evolution deduced from bright cluster 
samples extracted from the RASS led to Qm = 0.28101, Og = 0789.1 for a spa- 


tially flat model with a cosmological constant and Qm = 0240.5. Og — 0.854013, 


w= 1.44104 [149], see also [59]. These results were obtained by studying the 
unbinned (redshift, luminosity) distribution of clusters in the models. These works 
also discuss the impact of various systematics and prior assumptions on their results, 
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Fig.3 Figure similar to Fig. 1, displaying the X-ray luminosity function at higher redshifts, z > 0.6. 
The local REFLEX-II luminosity function is shown as a dashed line. References: see Fig. | and 2. 


for instance the value of the Hubble constant extracted from CMB observations, 
and uncertainties on the theoretical mass function. Notably, the transformation from 
mass to luminosity requires external calibration from local samples [e.g. 207]. A 
very similar analysis based on the identical dataset in addition to the 400d clus- 
ter sample enables constraining the y parameter entering the definition (see Eq. 9) 
of the growth-rate parameter f(z) to y — 0.514016 for flat ACDM model [201]. 
This indicates no departure from General Relativity on cosmological scales. The 
XLF of REFLEX-II clusters (dashed line in Fig. 1) provided preliminary constraints 
Om = 0.29 +0.04, og = 0.77 + 0.08 [23] in agreement with numerous cluster stud- 
ies. The reported amount of systematic uncertainties is highlighting the need for 
more robust scaling relations as the samples grow in size. From the XLF best-fit 
cosmology and uncertainties, Bohringer et al. [24] derive a model mass function for 
clusters in the Universe up to z ~ 0.25; in particular this model predicts that 14% of 
the total mass in the Universe is found within groups and clusters of masses above 
10PM.. 


3.3 The X-ray temperature function 


X-ray spectroscopy provides access to temperature measurements of the intraclus- 
ter medium. The measurements rely on spectral emissivities predicted by radiative 
transfer and atomic codes [e.g. 110, 232] and implemented in widely-used packages 
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such as XSPEC [6] or SPEX [111]. This effort is rewarded by temperature mea- 
surements being nearly independent of a cosmological model, hence facilitating the 
modelling. 

Prior to the advent of XMM-Newton and Chandra, detailed temperature func- 
tions (XTF) have been derived, mainly using ASCA, BeppoSAX and ROSAT data, 
including clusters up to redshift z ~ 0.8 [155, 91, 186, 106, 92]. Complementing 
these observations, theoretical studies highlighted the potential complication due to 
merging systems, for which measurements of the gas temperature and luminosity 
can be significantly boosted [200]. In general, the non-homogeneity of the gas den- 
sity and temperature throughout the intracluster medium has a non-trivial impact 
on the spectra collected in finite angular apertures. Aided by numerical simulations 
and assuming an adequate instrumental transfer function, one can relate the mea- 
sured 'spectroscopic-like' temperatures to the simulated gas properties [160]. 

The exquisite sensitivity, field-of-view and spectral resolution of Chandra, XMM- 
Newton and Suzaku made possible new measurements of the temperature function. 
Using observations with the Suzaku X-ray satellite, Shang & Scharf [226] study the 
local temperature function of a subset of the BCS cluster sample [63]. Following 
Ikebe et al. [106], this work models ICM temperatures with a two-component dis- 
tribution, the cooler plasma model intends to capture strong cool-core emission! !. 
The good agreement obtained with previous derivations reflects both the strong sam- 
ple overlap with earlier studies and the consistency between analyses. The alterna- 
tive determination of the temperature function by Henry et al. [93] accounts for 
selection effects and sample variance among other systematic effects. The theoret- 
ical mass function [108, 238] provides a prediction of the temperature distribution 
of local clusters by means of a temperature-mass relation. This scaling relation is 
a joint fit of sample data — hydrostatic and weak-lensing masses being available 
for a number of objects —, and of hydrodynamic simulation data. Since the sample 
is X-ray selected, effective area calculation requires knowledge of a luminosity- 
temperature relation. The values of cosmological parameters obtained in this study 
are Og (Qm/0.32)” = 0.86 +0.04 with æ = 0.30 (0.41) for O4, < 0.32 (Qn > 0.32). 

Instead of relying on a conversion from mass to temperature, some authors pro- 
posed a construction of the temperature function of clusters derived from the per- 
turbation of the gravitational potential field [e.g. 4]; this is perhaps more intuitive, 
since the gravitational potential is closely related to the temperature of gas particles 
(kT / umy ^: — 9/3) with Po being the non-linear gravitational potential depth near 
the minimum. Such formalism avoids resorting to the mass of a cluster and to the 
mass-observable relations, both requiring calibration. Such routes are promising in 
their ability to bypass intermediate, noisy quantities; however it does not remove the 
need to understand the multi-wavelength selection effects. 


!! A sizeable fraction of cluster thermodynamic gas profiles display ubiquitous deviations towards 
denser, cooler material in their central (R < 0.1R500) regions. The observational definition for cool- 
core clusters varies among authors, involving criteria such as central temperature, density, cooling 
time, entropy. More details are found in the Chapter “Thermodynamical profiles of clusters and 
groups, and their evolution" in this Section of the book. 


16 Nicolas Clerc and Alexis Finoguenov 


In general, temperature measurements require larger integration times than fluxes 
or luminosities; the signal-to-noise ratios also depend on the temperature itself. For 
instance, at fixed number of photons (N X 10?) and with the resolution and sensi- 
tivity of XMM-Newton EPIC cameras, lower temperature gas is easier to measure 
thanks to the presence of emission lines, at the expense of the increased uncertainty 
on metallicity [e.g. 260]. As a result, for reliable temperature measurements of hot 
(kT > 4 keV) clusters, energies below | keV are not used [e.g. 77]. Recently, a sys- 
tematic difference between XMM-Newton and eROSITA temperature measurements 
have been reported [242], while luminosity measurements agree. For these reasons, 
the task of building a temperature distribution for a homogeneous set of galaxy clus- 
ters is more difficult than establishing the XLF. 


3.4 The baryon mass function 


As proposed by Vikhlinin et al. [251], a measurement of the abundance of clus- 
ters as a function of their baryonic mass — that is a fraction of their total mass 
— offers an elegant way to bypass the need for mass-observable scaling relations. 
Since the baryon fraction in clusters should be close to the universal baryon frac- 
tion fp y = Qp/Qm, the theoretical halo mass function trivially relates to the baryon 
mass function through Mp = Y Mig /Qm. The depletion factor Y is close to one 
if clusters are representative of the universal baryon fraction; it is lower than one 
in general. The masses at fixed baryon overdensity 6, must be rescaled by a fac- 
tor Y- (1*9), on top of the f; E factor converting baryonic masses to total masses. 
The exponent œ accounts for the difference between baryon overdensity and total 
overdensity. It is found @ ~ 0.5 for a baryonic overdensity 6 = 324 (relative to the 
mean density of the Universe). Application of this method to a subset of the HI- 
FLUGCS sample [207] put constraints on Q,,4 = 0.13 + 0.07 and og = 0.72 +0.04 
assuming h = 0.71 [252]. The degeneracy in constraints between the parameters 
(Qm, Og) slightly differ from the local XTF analyses, at the expense of introduc- 
ing new parameter dependencies (on h, Q,h* and Y). Studies relying explicitly and 
solely on the baryon mass function are rare in the recent literature, most of the ef- 
fort focusing on joint constraints from gas mass fractions and total mass abundances. 
Caution must be taken when considering a wide range of halo masses: while contri- 
bution of baryons locked in stars is small in clusters (~ 5%), it reaches 30% in low 
mass groups [84, 136]. In addition, the baryons associated with envelope of central 
galaxy, which are notoriously hard to measure, play an important role in the total 
baryon budget [85, 128, 80]. Moreover, the evolution of the baryon fraction is still 
uncertain: one expects high-redshift and low-mass systems might lose baryons as a 
consequence of powerful AGN outbursts. 
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3.5 X-ray observable-space distribution: the logN-logS 


Early studies from the Einstein surveys, ROSAT (deep and all-sky) surveys and 
Chandra Deep fields enabled the measurement of the abundance of clusters as a 
function of their apparent X-ray flux S — usually cast in a cumulative form, the 
logN-logs: 


dn 


oo 
—2) _ / / P Wy 
N(> S)deg ^] = n n(S)dS , with n(S') = xi 


(16) 

It was noted early on that the logN-logS distribution derives rather straight- 
forwardly from the XLF, once proper K-correction factors are taken into account 
(which requires a temperature, redshift and at low temperatures also abundance dis- 
tribution). Therefore, cosmological information from the cluster mass function is 
propagated to the logN-logS, although in a complex form, because of the XLF evo- 
lution with redshift, of the mass-luminosity relation and of the selection function 
[e.g. 27]. 

An observational flux distribution can be derived from survey data once clusters 
are properly identified among X-ray sources and its construction does not require 
redshifts. However, the difficulty is to infer the total flux S of the cluster. To do so, 
one either performs a surface brightness analysis, or one uses the results of deeper 
observations on a similar mass cluster to link the aperture and total flux. Furthermore 
one needs to know the radial extent to which S is measured (e.g. 1 arcmin in XXL, 
Rsooc in CODEX [76], MCXC [189], deep surveys) and the level of point source 
contamination. A statistical correction for the area Q(S) probed at each X-ray flux 
(i.e. the selection function) is needed to account for missing systems in the recon- 
struction of the true logN-logS. For a purely flux-limited catalogue the correction is 
independent on the underlying cluster population. For instance, in a homogeneous 
survey with geometrical area Q, one would write: Q (S) = Q x 6(S — Siim) with © 
the Heaviside function. This formulation assumes that S = S?"5 is the measured flux 
(dependent on the observation, signal-to-noise ratio, etc.), in contrast to the true flux 
$'"'* of a cluster. 

In theory, accounting for flux measurement errors P(S?*|S’“°) and making an 
assumption on true underlying logN-logS'""* of clusters enables to properly correct 
for sample incompleteness: 


dn O(S cree dn 
= d S! rue P gobs st rue 17 
dsesdQ Q D, dS'reqo wj ) (17) 


However, the interesting quantity here is dn/dS'™’ and inversion of this equation is 
not necessarily a straightforward task. One may resort to forward-modelling tech- 
niques [76, 46]; deconvolution is also an option. In practice, such complications 
are often neglected and one may compute an effective Q*f (S'""*) through simula- 
tions of realistic observations. These simulations incorporate a known cluster flux 
distribution, and the effective area Q*f is obtained with the ratio of observed ver- 
sus true number of sources. The same simulations provide a mapping in the form 
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: P(S'""c|S^^5). An approximate method to recover the logN-logS""* from an ob- 
served sample then consists in calculating the logN-logS in terms of the observed 
flux, then resampling true flux values around each S^"* and finally dividing the re- 
sult by Q*f', If measurement uncertainties are negligible and the function linking 
Observed and true flux is invertible, a simple change of variables suffices. 

Another complication arises since more realistic selection functions P(/|S^^, 9) 
call for a model capturing additional cluster population properties 0, in a form that 


reads: 
dn dn 


ds%sdQ d0dO 


In these formulas, the notation P(/|parameters) represents the probability of includ- 
ing a source in a sample given a set of parameters. The vector represented by 0 
may contain random variables 04 that play an explicit role in the detection process 
(e.g. cluster shapes or number of photon hits deposited on the detectors) and random 
variables Om that do not explicitly enter the detection chain. Writing 0 = (04,04), 
we have: 


= [aerais^^. eyes e) (18) 
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Notably this expression does not involve the quantity we are looking for, that is 
the underlying true flux distribution. It may be obtained by calculating n(S'"*) = 
[d0 P(S""*|0)n(0), once the distribution n(0) is determined from Eq. 18 or 19. 
This short discussion sheds light on a difficulty encountered in comparing logN- 
logS derived from different surveys, each using different methods to approximate 
the required corrections. Let us consider for instance a selection depending only on 
the number of cluster-emitted photons hitting the detector in a survey, i.e. 04 = N. 
The correlation between S% and 04 may vanish in an experiment where fluxes 
come from independent follow-up observations; on the contrary, it equals one if 
flux is computed directly from the survey counts (e.g. if S% œ n). With this modifi- 
cation only, the observed flux histograms from these two experiments slightly differ 
because of the two different joint distributions P(S^^*, 04|0,) that enter Eq. 19. For- 
tunately, flux measurement uncertainties and biases are often negligible in practice 
and selection functions depend mainly on S% at first order, making the above for- 
malism somehow too demanding for standard usage. 


3.6 X-ray observable-space distribution: general observables 


The formalism exposed in the context of the logN-logS may be generalized to quan- 
tities other than flux. Most recent cosmological constraints from cluster abundances 
rely on a similar hierarchical Bayesian structure. 
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Taking advantage of the spectral capabilities of XMM-Newton, flux can be mea- 
sured in energy bands of AE X 1 keV in order to form hardness ratios". These bands 
should be narrow enough to capture broad spectral variations and still sufficiently 
large to preserve signal and to remain insensitive to small-scale spectral features. In 
absence of any other information, hardness ratios carry information about the tem- 
perature and redshift of clusters [e.g. 15]. By modelling the distribution of clusters in 
the XMM-Newton count-rate/hardness ratio space — i.e. replacing S% by (CR,HR) 
in Eq. 18 —, Clerc et al. [43] found that cosmological constraints are obtainable 
from a sizeable sample of clusters with moderate individual signal-to-noise data. 
The descriptive power of the observable-space cluster distribution increases with 
increasing number of observables, or dimensions. The process of adding redshift 
information to the flux distribution of clusters is closely related to studying the XLF 
evolution; this was a technique employed to constrain Qm and og from the RDCS 
sample [23]. Adding in the distribution of cluster apparent sizes as measured on 
X-ray images and an extra hardness ratio enables tighter constraints on cosmologi- 
cal and scaling relation parameters [188, 246]. The advantage of such methods lies 
in their ability to provide population models as close as possible to measurements, 
those measurements being devoid of any model assumption. One downside is the 
potentially infinite number of observable parameters to choose from and the high 
computational demand in modelling multi-dimensional distributions. 

Grandis et al. [56] develop a formalism applying to multi-wavelength observ- 
ables and selections. A cluster number count analysis combining RASS (X-ray) and 
DES? (optical) relies on a likelihood function involving the density of clusters per 
bin of X-ray flux (S) and optical richness (A). This density may be rewritten in the 
form of Eq. 19 with 0g = (S?°, A) and Om = (S°, Atrve), 

A framework is proposed by Mantz [151] in the context of performing regression 
on truncated data. It is illustrated with examples specific to modelling galaxy cluster 
scaling relations, when some systems are absent from a selected sample. It shares 
similarities with the formalism presented in this Chapter (in particular Eq. 19); these 
works underline the tight coupling between two apparently distinct tasks, one of 
constraining the cosmological parameters and the other of deriving the cluster scal- 
ing relations. 


3.7 Recent cluster abundance studies 


Vikhlinin et al. [250] present cosmological parameter constraints obtained from 
Chandra observations of 37 clusters with (z) — 0.55 derived from 400 deg? ROSAT 
serendipitous survey and 49 brightest z ~ 0.05 clusters detected in the All-Sky Sur- 
vey. Evolution of the mass function between these redshifts requires Q4 > 0 with 
à ~ 50 significance, and constrains the dark energy equation-of-state parameter to 


1? A standard definition for hardness ratios is HR — H /S with H the X-ray count-rate (or flux) in 
a ‘hard’ band, S in a ‘soft’ band. For details refer to Chapter “X-ray color analysis" in this book. 


13 The Dark Energy Survey covers 5000 deg? in the Southern sky in five optical bands. 
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wo = —1.14 X 0.21, assuming a constant w and a flat universe. Fitting their clus- 
ter data jointly with the latest supernovae, Wilkinson Microwave Anisotropy Probe, 
and baryonic acoustic oscillation measurements, they obtain wọ = —0.991 + 0.045 
(stat) +0.039 (sys). The joint analysis of these four data sets puts a conservative 
upper limit on the masses of light neutrinos ) my < 0.33eV at 95% CL. 

The study of the abundance of massive and bright RASS clusters by Mantz et al. 
[150] improves on previous analyses by resorting to deep X-ray follow-up of a sub- 
stantial fraction of the sample. Their statistical framework distinguishes between ob- 
served quantities and intrinsic quantities and accounts for selection effects and scal- 
ing relations in a self-consistent manner. The addition of 50 weak-lensing mass mea- 
surements (more exactly: shear profiles) among the 224 clusters brings new infor- 
mation on the absolute mass scale of clusters [153]. Although weak-lensing masses 
are individually noisy, they are thought to be of little bias and their inclusion in the 
model leads to stronger constraints on virtually all model parameters — although the 
scatter in individual weak-lensing masses prevents from a large increase in preci- 
sion on parameters sensitive to redshift evolution, such as w. Adding in the (report- 
edly independent) gas fraction measurements of relaxed clusters in the radial range 
0.8 < r/R5soo < 1.2 by Mantz et al. [152], the analysis leads to Qm = 0.26 + 0.03 
and og = 0.83 + 0.04. The study of Mantz et al. [153] explores various extensions 
to the flat ACDM model, in particular it constrains Q4 to 0.73 +0.12 with a strong 
preference for zero metric curvature. The equation of state of dark energy is also 


constrained with good precision (w = —0.99 + 0.06) as well as evolving equation of 
state (wo = —1.04*01. W|— 0.3105. Interestingly, the adopted mass calibration 


does not provide a low value of os, in contrast to other contemporaneous cluster 
studies; this translates into a constraint on } my that is compatible with zero. 

With the help of Chandra data, Schellenberger & Reiprich [218] revised the anal- 
ysis of the HIFLUGCS sample [207] consisting of 64 galaxy clusters and groups 
located at z 5 0.1 and primarily selected upon their high apparent X-ray flux in 
RASS. Measurements of hydrostatic masses and gas masses for all clusters pro- 
vide the basis of the gas fraction test (or fgas, see Sect. 6). This test relies on the 
well-determined value of Q, from CMB data, and on the assumption that baryon 
cluster content is close to representative of the whole Universe baryon content. The 
relationship between the two is parameterized as a function of mass, redshift and 
radius with priors from numerical simulations. This test delivers a posterior confi- 
dence range for Qm; the mass function analysis in turn may take this as a prior in the 
cosmology fit. The slope and normalisation of the luminosity-mass relation are fit si- 
multaneously with cosmological parameters. The best-fit values are Qm = 0.221002 
and og = 0.89 + 0.010 (without fgas priors on Om) and Qn = 0.30 + 0.01 and 
Og = 0.79 + 0.03 (with feas prior). This study describes a number of systematics 
and corrections; among those the extrapolation of mass, gas temperature and den- 
sity profiles for truncated observations and the impact of baryons on theoretical 
mass function. In particular, the inclusion of groups and dynamically disturbed sys- 
tems is found to bias the fit towards low values of Qm. This bias is due either to 
sample incompleteness at low mass, or to a different luminosity-mass relation in 
this regime. Such a modification in the scaling relation possibly originates from the 
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impact of baryonic physics onto observable quantities that differs between clusters 
and groups. 

Focusing on a lower-mass regime, the high-significance subset of 178 galaxy 
groups and clusters in the 50 deg? XXL survey extends out to z ~ 1. The distribution 
of these systems along the redshift dimension is modelled in Pacaud et al. [180] 
and leads to the following constraints: Qm = 0.316 + 0.060, og = 0.814 +0.054 
and w = —1.02 + 0.20. Despite the very different ranges of masses probed by both 
surveys, the ACDM fit (fixing w — —1) agrees well with Planck-SZ cosmology. 
Given the uncertainties it does not show any significant tension with Planck-CMB 
cosmology. The relation between mass and observables — necessary to model the 
selection of objects — involves scaling relations derived from the same sample (or a 
subset thereof). In particular the mass-temperature relation relies on weak-lensing 
mass measurements. 

CODEX [76] show an analysis of the XLF in a ACDM context using 24,788 
RASS sources identified in the 0.1 « z « 0.6 range as galaxy clusters with redMaP- 
Per5.2 run on 10,382 square degrees of SDSS!^ photometry. Under assumption of 
self-similar evolution of scaling laws, and a 5% error achieved on the Ly — M cali- 
bration, and taking into account the survey selection both at X-rays and in the opti- 
cal, they find O4, = 0.270 +0.06 +0.015 and og = 0.79 +0.05 + 0.015. 

A subset of the CODEX clusters confirmed with optical spectroscopic follow-up 
[116] led to precise (Az ~ 0.001) cluster redshifts. These 691 sources benefit from 
optical richness measurements based on DESI? preparation imaging data that serve 
as a mass proxy in the cosmological analysis of Ider Chitham et al. [105]. There, 
X-rays enter only in the selection of the systems; an additional redshift-dependent 
richness selection helps in increasing the purity of the sample and in raising the 
spectroscopic confirmation rate. The resulting constraints read Qm = 0.34+0:09 and 
Og = 0.73 +0.03. Replacing richness by 1-dimensional galaxy velocity dispersions 
as a mass proxy, these read Qm = 0.33 + 0.02 and og = 0.4977 [116]. In the 
latter analysis, a theoretical scaling between the total mass and observed velocity 
dispersion is used, which decreases the uncertainties. 

In comparison of different results, it is important to separate the measurements 
that constrain the same process, where results should be the same, with extrapola- 
tions (for instance, cluster abundance studies are directly comparable to the large- 
scale-structure cosmology using weak lensing shear and galaxy correlation func- 
tion). Recent results by KiDS!6 and DES confirm the somewhat lower values of 
Sg inferred from cluster studies. As an example of possible solutions to a tension 
with Planck-CMB cosmology, Bóhringer & Chon [22] showed that a non-zero neu- 
trino mass may reconcile the REFLEX-II galaxy cluster XLF best-fit og with the 
(2014) Planck-CMB best-fit amplitude of the matter fluctuation power-spectrum; 
although uncertainties in the mass-luminosity relation of clusters prevents from get- 
ting precise constraints on the neutrino masses. Although the required value of the 


14 The Sloan Digital Sky Survey has created a wide map of the Northern sky in five optical bands. 


15 The Dark Energy Spectroscopic Instrument will obtain optical spectra for tens of millions of 
galaxies and quasars. 


16 The Kilo-Degree Survey has mapped 1350 deg? of the sky in four broad-band optical filters. 
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neutrino mass is not supported by the final Planck-CMB results, consistent values 
were reported using a combination of ACT and SPT and WMAP" [61]. Thus, this 
possibility remains open for the future studies, e.g. with new CMB experiments. 

As illustrated by Eckert et al. [64], X-ray emission is a low-scatter mass proxy at 
~ 0.5R500c. The scatter increases towards R5oo. and it is not well known beyond it. 
Mulroy et al. [170] showed that the scatter in Compton Y parameter depends on the 
resolution of the measurement, as inclusion of the zone outside of R50oo. in Planck 
measurements, leads to larger scatter. X-ray morphology studies have been used to 
separate out regular clusters. Cluster detection based on the concentration of red 
sequence galaxies using comparable radii is only possible for rich end of the cluster 
mass function [51]. Spectroscopic selection of galaxy groups often uses Rooc, in 
order to improve the sensitivity, which leads to a large variety of dynamical states 
of the detected objects, not all of which are containing massive virialized halo. 


4 Clusters as tracers of large-scale structure 


Itis well-recognized that collapsed halos cluster in a different manner from the un- 
derlying matter field [e.g. 113]. A simplified understanding may be gained by con- 
sidering the superposition of two random fields fluctuating on much different spatial 
scales. The slowly varying random field S is supposed to have smaller amplitude. In 
terms of typical variances, 0; < s hence S ‘modulates’ the rapidly evolving field 
N. In absence of modulation, objects form at locations where dy/oy > v and the 
probability for an object to form, Py(> v), is given by the properties of the noise 
field. In presence of the field S, the resulting overdensity field is 6 = ôs + dy and 
the probability for an object to form at location X is spatially modulated; it writes: 


P(» v,X) ~ Py(> v — ós/ ow) ~ Py(> v) [1 + Bés(X)] 


The second equality is a first-order Taylor expansion with B being expressed in 
terms of Py, its derivative, and c. Hence, the overdensity contrast of objects at loca- 
tion X relative to the average density of objects writes 6), (X) = [P( v, X)/ P (> v)|— 
1 ~ Bds(X). The ratio between the object and ‘background’ overdensity fields, 
0, / s = B translates into a "bias" factor B? in the ratio of spatial correlation func- 
tions of objects and of the field S, as is clear from the definition of the correlation 
function €: 


Enn (r) = (8, (X) En (X - 7) c B^ (Ss (X) 8s (X - 7)) = B'Ess(r) 


17 The Atacama Cosmology Telescope, the South Pole Telescope and the Wilkinson Microwave 
Anisotropy Probe provide maps of CMB anisotropies, either from ground or from space. 
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4.1 Two-point clustering of halos and the bias parameter 


Beyond this illustrative picture, the definition for bias is well established in the case 
of random fields representative of the large-scale matter distribution [e.g. 12]. On 
spatial scales large enough to be insensitive to the non-linear evolution of the density 
field, the ratio between the halo and matter power-spectra defines the bias parameter 
D?. The ‘peak-background split formalism’ provides an useful means to estimate the 
variation of bias with object mass [e.g. 12, 163, 229]; it involves knowledge of the 
mass function of halos conditional on the background density and a prescription for 
relating Lagrangian to Eulerian overdensities [e.g. 264]. Calibration of the large- 
scale halo bias using numerical simulations allows capturing departures from the 
peak-background split model and accurate fitting formulas provide state-of-the-art 
values for the bias. These are usually given as a function of mass, b(M), or as a 
function of peak height, b(v), for several mass definitions and overdensity thresh- 
olds. Two such formulations for the large-scale halo bias are brio [239] and bpii 
[20], expressed as: 


a 


brio(v) 21—A Bv +Cv° (20) 


V 
v4 + 6¢ 
av? —q 2p/ó. 


a NET 


(21) 


In both formulas, v = 6,/o0(M,z) is related to a halo mass M4. Eq. 20 is calibrated 
on N-body simulations with (a,b,c,A,B,C) being dependent on the overdensity A 
and independent on redshift [239]. Eq. 21 derives from the peak-background split 
formalism applied to the halo mass functions established in Bhattacharya et al. [20]; 
the fitting parameters (a, p,q) therefore intervene in the expression for f(o) (as 
defined by Eq. 13) and may [20] or may not [47] depend on redshift. 


4.2 Constraints from X-ray clusters two-point clustering analyses 


Due to the sensitivity of the clustering amplitude of dark matter halos of a given 
mass to the underlying cosmology the application of the clustering theory to galaxy 
clusters is theoretically highly motivated. Moreover, galaxy clusters exhibit scaling 
relations between their baryonic properties and the total mass of their hosting halos. 
These properties include among other the cluster X-ray luminosity and richness (that 
is, number of galaxies belonging to the cluster). Using on one hand these observable 
quantities as mass proxies and on the other hand the connection between cluster 
masses and their bias one can make inference on cosmological model. Most attempts 
to follow this route have, however, resulted in cosmological parameters, which are 
in discordance with the constraints obtained using the number counts of the same 
sample [223] or lead to strongly disagreeing scaling relations [109]. Remarkably, 
[3] obtain a precise agreement between the modelling of the clustering, based on 
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the weak lensing mass calibration and the measured clustering amplitudes of galaxy 
groups, by taking a detailed account on the definition of the object and the effects 
of sample variance. 

The power-spectrum analysis of the REFLEX-II sample [23] is presented in 
Balaguera-Antolinez et al. [10] with a comparison and validation supported by large 
mock catalogues mimicking the original survey selection. The X-ray luminosity- 
dependent clustering bias of clusters is modelled both in redshift- and real-space by 
means of a mass-luminosity relation calibrated such as to recover the REFLEX-II 
XLF; the effective bias is an average weighted over the mass distribution of clus- 
ters. Covariance matrices are obtained from the mock catalogues, highlighting the 
strength of off-diagonal terms due to mode-coupling, especially at small scales. The 
systematic distortions of the power-spectrum induced by the flux-limited nature of 
the sample are potentially important limitation in the analysis of the REFLEX-II 
sample, akin to Malmquist bias in abundance studies [e.g. 222]. The analysis of the 
mocks indicate negligible effect in the range of wave numbers of interest. The am- 
plitude of the power-spectrum derived from subsamples selected upon varying lumi- 
nosity thresholds shows good agreement with predictions from the ACDM model. 
While Baryon Acoustic Oscillations (BAO) signatures are not expected to be distin- 
guishable given the obtained signal-to-noise ratios, the shape of the power-spectrum 
hints for departures from the linear structure growth at intermediate scales. 

Despite its relatively small sky coverage, the increased sensitivity of the 50 deg?- 
wide XXL survey probes low cluster masses (median Msoo ~ 10/^M.) at high 
redshift (median z ~ 0.3). The clustering analysis of 182 high-significance X-ray- 
detected systems in both the Northern and Southern halves of the survey [158] con- 
strains Qm and the effective bias of the cluster sample beff, to 0.274006 and 23:0 
respectively. The latter constraint matches well the bias predicted in the ACDM 
model, the mass of XXL clusters being derived from the weak-lensing M,,; — Tx 
relation as measured from a subset of those systems. Limited statistics prevent from 
fitting additional parameters, as well as marginalizing over uncertainties in the mass- 
observable relations. The constraints from the XXL survey ultimately should follow 
from a combination of abundance and clustering analyses [187]. 

The analysis of the CODEX galaxy clusters [144] aims at predicting the clus- 
tering bias of these systems based on their masses. All three components in the 
analysis — the clustering of galaxy clusters (or dark matter halos in the case of simu- 
lations), the expected dark matter distribution and the mass-to-bias conversion — de- 
pend on the assumed cosmology. Bringing them into agreement therefore provides 
a test of the cosmological model. A parallel analysis of the halo catalog built from 
a (4 h! Gpc)? dark matter simulation [120] shows the ability of this approach to 
recover Qm and 6g within statistical uncertainties. The likelihood involves a com- 
parison between the two-point correlation function of halos and the total matter 
power-spectrum rescaled by the bias factor, as predicted from the mass of the sys- 
tems. CODEX cluster masses are estimated either from their X-ray luminosities or 
their optical richnesses; these estimates predict bias factors in agreement with the 
measured value at the level of 17-39%, with richness-based estimates providing bet- 
ter agreement. Splitting the sample in redshifts bins leads to predicted bias values 
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agreeing with the measured values at the 5-17% level, depending on the redshift 
range. The clustering analysis in redshift bins provides the following constraints: 


Qn = MP e and og — Usa Additional systematic uncertainties of +0.02 


and +0.19 respectively, arise due to survey effects and bias modelling. Combining 


the clustering-based likelihood with one from the cluster mass function the follow- 


ing constraints follow, also represented in Fig. 4: Qm = 0.27 00) (stat.) +0.07(syst.) 


and og — 0.79* 009 (stat. ) + 0.04(syst.). The consideration of covariance errors 
makes an estimate of their contribution below the statistical error of this analysis, 
while this results shows a clear dominance of systematics, within which no differ- 
ence to Planck-CMB cosmology could be claimed. 
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Fig. 4 The likelihood function of (Qm, Og) from the cluster two-point correlation function (blue), 
X-ray luminosity function (red) and joint distribution (green). The light and dark contours are the 
68 % and 95 % confidence regions, respectively and red dashed lines show the best fit values for the 
joint likelihood. Credit: Lindholm et al., A&A 646, A8, 2021, reproduced with permission OESO. 


Forecasts for large cluster survey experiments combine the abundance and clus- 
tering into a powerful probe of cosmological models. Of specific interest there 
are departures from Gaussian initial fluctuations'*. A popular, albeit not unique, 
parametrization of primordial non-Gaussianities involves the dimensionless param- 
eter fur quantifying the amount of quadratic corrections added to the Gaussian po- 
tential; it relates to the skewness in the distribution of the overdensity contrast 6. In 


'8 Although recent results from Planck-CMB rule out high levels of departure from Gaussian initial 
perturbations [194]. 
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a work addressing X-ray surveys, Sartoris et al. [216] predict constraints on primor- 
dial non-Gaussianity involving both the power-spectrum and number counts from 
samples of sizes > 10°. This study demonstrates their complementary strengths, 
particularly in breaking degeneracy between fnt and og. The specific case of the 
eROSITA all-sky survey, promising (10?) clusters, is studied in Pillepich et al. 
[191]. Several scenarios are investigated; the most favourable one involves X-ray 
follow-up observations of a subset of objects in order to constrain the luminosity- 
mass relation at a level beyond that currently known. In this case, the expected one- 
sigma uncertainty on Qm, Og, wo and w; amount to +0.006, +0.008, +0.07 and 
+0.25 respectively, after combination with complementary cosmological probes 
(CMB, BAO, Type Ia Supernovas). 


5 Sample variance considerations 


Sample variance originates from observations of finite volume of the Universe and 
the fact that large-scale structure is spatially correlated. Counts or clustering ob- 
servables extracted from surveys of any size are affected by this additional term, 
regardless of the mass of the tracers [e.g. 163]. 


5.1 Variance in cluster number counts 


The relative sample covariance OF; between two estimates of galaxy cluster number 
density n;(Mi,z;) measured in two volumes described by their window functions 
W;(X) such that f drdQW; = 1 may write [104]: 


(ninj) —ñiñj — au un : dk SN eee 
96; — omm o PAM ze) (Mj, DD) J Grp 0 ae 


where ñ is the average density of clusters, P(K) is the linear matter power-spectrum 
today, D(z) is the linear growth-rate as a function of redshift, b(M,z) is the linear 
bias parameter for objects of mass M at redshift z and three-dimensional Fourier 
transforms are annotated with a hat. The additional variance term due to cluster 
counts following a Poisson distribution is usually denominated 'shot-noise vari- 
ance'; it adds quadratically to sample covariance. Hu & Kravtsov [104] consider 
the rather general case of a survey window function that is separable into an angular 
mask and a selection varying with the radial direction. For number counts in two 
radial bins indexed by i and j in the local Universe, Eq. 22 reads: 


nin;)— n dk ~ E ~ 2k P(k 
mn -b yan / g FIOR AQ) |Orm| D Q3) 
l,m 
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where window functions are such that W; = R;(r)O (0, ). Rj(k) is the Bessel trans- 
form of the radial window, ©,,, is the harmonic transform of the angular mask. Sim- 
ilar equations found successful application to galaxy surveys [e.g. 174, 241, 166]. 

For local all-sky cluster surveys comprising a single mass and redshift bin and 
extending out to a radius R, the variance term reduces to multiplying 5? by Ge the 
variance of the linear matter field smoothed over a radius R. The scaling of sample 
variance with the survey volume V « R? depends on the shape of P(k). For nearly 
flat power-spectra, sample variance thus scales as 1/V. Shot-noise relative variance 
scales as 1/AV, therefore for such survey configurations, the relative importance of 
sample variance and shot-noise depends only weakly on survey volume. For a given 
local survey volume, the relative importance of shot-noise increases with increasing 
mass selection threshold, due to the rarity of high-mass clusters. 

An example use of the formalism is provided in Fig. 5 assuming the Planck-CMB 
cosmology [192]. Focusing on the number “M (> M") of clusters per solid angle in 
Az = 0.1 bins above a certain mass threshold, the diagram displays the loci of sur- 
vey configurations for which the uncertainty on .V is dominated by shot-noise or by 
sample variance. For a given redshift bin, deep surveys corresponding to low thresh- 
old masses provide large amounts of clusters; for those surveys sample-variance 
considerations are relatively important. The boundary between the two regimes de- 
pends on the area covered — here a spherical cap geometry was assumed in our 
calculation. The location of these boundaries additionally depends on the binning in 
redshift and on the upper mass selection threshold. 

Formulas such as Eqs. 22, 23 may be implemented with the help of a library such 
as COLOSSUS [62]. As an application, we find the following empirical fit for the 
ratio between the sample variance and shot-noise variance in the sky number density 
WN (units deg?) of clusters above a given mass Miim p in a narrow bin of redshift 
Az S 0.1: 


Ov Rd HE 3 ae | x 0.4049 (24) 
Oi  1.5x10-9(h-!Mpc) 4 

This crude approximation holds for a continuous survey coverage of area greater 
than 1000 deg?, at redshifts 0.1 X z <2 and 10 Mo < MIS, <few 10!4Mo ina 
Planck-CMB cosmology. The halo bias b(z) is averaged over the population of clus- 
ters satisfying M > M!" and the volume density of halos n has units (h^! Mpc) 2. 

Accounting for a mass- and redshift-dependent cluster selection function in 
Eq. 23 involves considering the volume probed by the survey at a given mass in- 
dex; the radial bins then consist of overlapping, increasingly large spheres. Hu & 
Kravtsov [104] examined the specific case of a local, flux-limited X-ray survey with 
R = (13-z) ! / Lx (M)/ (Az fiim), with Ly (M) the scatter-less mass-luminosity re- 
lation. In practice, X-ray cluster selection is not similar to a mass threshold and the 
derivation of sample variance should be reassessed with the exact selection cuts. For 
surveys extending further out in redshift, the relative importance of shot-noise vari- 
ance and sample variance depends on the sky coverage, the mass selection threshold 
and the maximal redshift. Shot-noise is found to dominate at high redshift where 
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Fig. 5 Estimating the relative amount of sample variance and shot-noise in galaxy cluster count 
experiments in broad redshift bins. Any point in this diagram corresponds to a limiting mass Mim 


at given redshift z. The curves delineate the boundary between regions in this diagram where sky 
number densities in Az — 0.1 bins are shot-noise dominated (above) and sample-variance domi- 
nated (below). Each curve is calculated following the equations in Valageas et al. [245] and assum- 
ing a ‘spherical cap’ survey geometry covering a solid angle AQ. As an example, the variance in 
the number density of M > 101^n-! Mo clusters satisfying 0.4 < z < 0.5 (grey cross) is dominated 
by shot-noise in a 1 deg? survey; but it is dominated by sample-variance in a full-sky survey. 


massive halos are rare. For the specific cases of XMM-XXL and the eROSITA all- 
sky survey the transition redshifts between the sample-variance-dominated and shot- 
noise-dominated number count experiments in Az ~ 0.1 bins are z ~ 0.7 and z ~ 0.5 
respectively. 


5.2 Extensions of the sample variance formalism 


The analytic formalism of [104] is extended beyond number counts by Valageas 
et al. [245] to the sample variance of the halo correlation function £ (r), for various 
estimators including the Landy-Szalay one; the effects of redshift-space distortions 
discussed in Valageas & Clerc [244] are of tiny impact. 

Takada & Hu [235] dubbed 'super-sample covariance' the effects due to sampling 
the power-spectrum of the matter in a finite Universe volume. In their proposed 
approximation, the modes with inverse wave numbers exceeding the survey size act 
in a similar fashion as a uniform rescaling of the density background. Calibration of 
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the power-spectrum response to a perturbation 6, of background density provides 
a term additive to the power-spectrum covariance. In the frame of the halo model, 
halo sample variance appears as a limiting case of super-sample covariance in the 
strongly non-linear regime. A concise and generic formulation for 'super-sample 
covariance’ between two observables ?, and 65 collected in two redshift bins is 
[e.g. 132]: 


001 fa) 
Covssc (@1, @2) = fav, dV; = 


ae 1) 5g, @ z2) oss (21522) (25) 


where the integral runs over two volumes defined by the redshift bins and the an- 
gular window functions. The term 6?(z1,22) is the covariance of the background 
density fluctuation (ô, (z1 )ôp(z2)), related to P(k) and to the growth rate D(z). The 
comoving density of the observable, o;, reacts to a change in background density 
as doj/0 0j. Specifically for cluster number counts experiments, Ê; = N(iy,i;) and 
€» = N(jm, jz) are the numbers of clusters in two mass and redshift bins indexed 
by (iy, i;) and (jy, jz). Thus o1 =n(iy,z) and o2 = n(jy,z) are the densities of 
clusters in those mass bins, and the following holds: 


on d 
38, 2 = fi, OM Gg Pc) (26) 


Méiy 


which is the average of the linear halo bias weighted by the halo mass function 
dn/dM. Following this formalism, Lacasa et al. [131] demonstrate that for large 
redshift bins (of size Az ~ 0.1), the formula in Eq. 22 underestimates the sample 
variance by ~ 30%. In fact, the derivation in Hu & Kravtsov [104] implicitly as- 
sumes non-evolving bias within a window function support domain. Both descrip- 
tions (Eq. 22 and 25) therefore coincide in the limit of small redshift bins. Correla- 
tions between redshift bins that are large compared to the angular size of the survey 
can be ignored and this leads to a computationally simpler expression [245, 126]. 

Since galaxy cluster bias b is a prediction from the cosmological model, the vari- 
ance in cluster abundance across multiple patches of the sky encloses constraining 
power. With simple assumptions on the sample selection mechanism and assuming 
a low (known) scatter in the mass-observable relation, Lima & Hu [142] show how 
full accounting for sample variance effects can break degeneracies induced by a 
mere number count analysis. Their likelihood represents a joint fit of cosmological 
parameters and of the mass-observable relation. Information on the evolution of the 
scatter and bias can be retrieved at the expense of additional binning of the data 
with a mass proxy [143]. Covariance among count-in-cells also probes primordial 
non-Gaussianity of the local type thanks to the strong scale dependence of the halo 
bias and the coupling of cells on scales of hundreds of Mpc [175, 53]. 

Sample variance is also a concern for numerical simulations aiming at reproduc- 
ing the surveys under study. Klypin & Prada [119] estimate that multiple simulations 
boxes with sizes ~ (1 —1.5)h~! Gpc are suited to most cosmological studies, includ- 
ing galaxy cluster abundance studies, without significant loss of power. Running 
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such a large volume with full hydro-dynamical simulation represents a substantial 
investment of computing resources. 


6 Clusters as standard candles 


Cosmological constraints often rely on standardizable probes to perform cosmogra- 
phy or evolutionary experiments — a typical example being Type Ia supernovae and 
their use as standard candles. Galaxy clusters enter this class of cosmological studies 
in complementing the tests relying on population distributions described previously. 


6.1 The gas fraction tests 


In Sect. 3 we presented a cosmological test based on the abundance of clus- 
ters as a function of their baryonic mass. Similar arguments provide support to a 
quite different and powerful test that consists in measuring the gas mass fraction 
(feas = Mgas /Meot, the ratio of gas and total masses) in galaxy clusters, and in relat- 
ing this value to the Universal fp y = Qp/Qm prediction. Unlike abundance studies, 
truncation of samples due to selection effects is less of a concern; the main difficulty 
consists in choosing systems for which only small departures from spherical sym- 
metry and hydrostatic equilibrium are expected. They should be massive enough 
for their matter content to be representative of the whole Universe and for hav- 
ing a reduced stellar fraction. Under such conditions the universal baryon fraction 
should approximately match fas with a small amount of cluster-to-cluster variance. 
With robust priors on Qy/? and the Hubble constant, the application of this test 
at low redshift provides tight constraints on Qm [see e.g. 259, for an early appli- 
cation]. Furthermore, assuming a limited evolution of fgas with time (as indicated 
by numerical simulations incorporating cooling and AGN feedback) serves in con- 
straining parameters governing the evolution of the luminosity and angular diameter 
distances, hence the cosmological parameters entering H(z). Subsequent to the re- 
sults and analyses of Allen et al. [2] and Ettori et al. [68] putting constraints on Qm, 
Q; and w, Mantz et al. [152] presented a new sample of gas fraction measurements 
in 40 massive and hot clusters extending out to redshift z — 1.1. A noticeable fea- 
ture is the systematic selection of dynamically relaxed clusters based on their X-ray 
appearance in deep Chandra images. Gas fraction measurements in a radial range 
[0.8, 1.2]R2500 constitute the observable that is compared to the following model 
[152]: 


Be (0.8 < r/R < 12) = K(z)A(z)Y(z) (2) [e k (27) 
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where d(z) is either the luminosity (Dr) or angular (D4) diameter distance, K = 
Ko(1 + Kız) represents the ratio between total and X-ray mass, modelled with a 
linear dependence with redshift. A accounts for the shift in radial range with chang- 
ing cosmology, Y = Yo(1+Y1z) is the gas depletion factor scaling linearly with 
redshift. The superscripts "ref" indicate the values for the reference cosmology 
fixed in the process of measuring fgas. In the Mantz et al. [152] analysis, lens- 
ing masses for 12 clusters provide constraints on Ko, as all cluster masses are 
derived after deprojection of temperature and density profiles under the assump- 
tion of hydrostatic equilibrium. Numerical simulations [195, 16] give the prior on 
Yo. The analysis assumes uniform priors on Kı and Y; centered around zero. The 
low-redshift data leads to tight constraints on I?! 205 /Qm. Once combined with 
local Hubble constant measurements and Big Bang Nucleosynthesis, these trans- 
late into Qm = 0.27 + 0.04. For non-flat ACDM models, the full sample provides 
Qa = 0.654937 and Om = 0.29 + 0.04. Models considering the equation of state of 
dark energy in flat ACDM are constrained with good accuracy thanks to the redshift 
leverage arm in the sample and a reduced amount of systematic uncertainties in the 
derivation. 

Modifications or extensions to the model above (Eq. 27) can be exploited to test 
alternative cosmological scenarios. The analysis of Magaña et al. [148] combines 
feas samples established in X-ray analyses [2] and in SZ [90] to examine various 
models for the equation of state of Dark Energy. Some of these models tend to 
marginally indicate a slowing down of cosmic acceleration at late times, although 
this result vanishes as CMB, BAO and Hubble constant measurements are included 
in the analysis. Li et al. [140] argue that f (R) theories of gravitation induce a change 
in the relation between gas temperature (hence, the inferred X-ray hydrostatic mass) 
and the true cluster mass in comparison to standard gravitational theory. This change 
should lead to a modification of the relation between fgas and Qp / 9; their analysis 
of the Allen et al. [2] sample thus excludes strong departures from general relativ- 
ity. Focusing on departures that break the duality relation between luminosity and 
angular diameter distances (see below), Holanda et al. [98] use the sample of fgas 
by Mantz et al. [152] in combination with temperature measurements of the CMB 
at various redshifts in order to establish consistency with general relativity. 


6.2 Distance measurements with combined X-ray and SZ 
observations 


The inverse Compton scattering of photons constituting the cosmological back- 
ground radiation (CMB) by energetic electrons in the intracluster medium distorts 
the blackbody CMB spectrum. This process called the Sunyaev-Zeldovich (SZ) ef- 
fect imprints characteristic AT /Tcwg ~ 10^? secondary temperature fluctuations in 
the CMB sky observed at frequencies around v ~ 100 — 300 GHz, typically over 
arcminute angular scales corresponding to a cluster apparent extent [234]. In the 
Rayleigh-Jeans part of the CMB spectrum, the temperature decrement imputed to 
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the thermal SZ effect reads: 


kT, 
CMS J orneTdl (28) 


ATcmp œ —2 

MeC 

with or the Thomson cross-section, 7; the electron temperature and the integral is 

performed along the line of sight. The dependence on n, and T, in the integrand 
differs from the X-ray surface brightness dependence (Eq. 29). 

A cluster of typical size .Z along the line of sight imprints a SZ decrement scaling 
as AT /T ~ .Z/n,T, (a bar indicates averaging along the line of sight). On the other 
hand its X-ray surface brightness scales as Sy ~ Ln2Agoo1, With Acoo; the X-ray 
cooling function, dependent mainly on gas temperature and elemental abundance. 
Assuming spherical symmetry of the intracluster gas, & can be obtained from a 
measurement of the cluster angular extent A0; cancelling n, in both expressions 
thus provides an estimate of the angular diameter distance at the redshift of the clus- 
ter, DA(z) = .Z/A8, that can be compared with the prediction of the cosmological 
model. 

The argument presented here in a simplified form supported early attempts 
at combining X-ray and millimeter-wavelength observations of galaxy clusters, 
with the goal of retrieving the Hubble constant Ho and the deceleration parame- 
ter qo — — da /à)| (o [e-8. 230, 37]. The assumption of spherical symmetry may 
be dropped by performing sample averages and the test can incorporate more so- 
phisticated gas shapes. Consequently this test is also sensitive to the distribution 
of cluster shapes in a sample; by this means De Filippis et al. [56] discovered 
a bias in a sample of 25 X-ray selected clusters being preferentially elongated 
and aligned along the line of sight. Under the assumption of spherical symmetry 
and hydrostatic equilibrium of the gas, Chandra X-ray data and resolved observa- 
tions of the SZ-effect in 38 clusters at redshifts 0.1 « z « 0.9 lead to the value 
of Ho = 76.0127 (stat.) urs (syst.)km s7! Mpc^! for a ACDM cosmology with 
Qm = 0.3 and Qa = 0.7 [26]. 

The expression for the X-ray surface brightness can be rewritten with an explicit 
dependence on the luminosity and the angular diameter distances: 


2 
n^(z) 2 
Saga | t PT 


with n (z) = Da(z)(1 +z)? /Dz (z). It makes apparent the result of the X-ray/SZ test 
being a measurement of D4/n? rather than D4. It thus provides a check of the 
*distance-duality relation’, that is an observational test of the equation 1] = 1 pre- 
dicted by the standard cosmological model [243]. To this end, an estimate of the 
distance luminosity at the redshift of the clusters is necessary. Assuming WMAP-1 
cosmology, the sample of X/SZ distance measurements of Reese et al. [206] shows 
no significant violation of the 7 = 1 relation [243]. This results holds if instead of 
assuming a cosmological model, supernova distance moduli measurements provide 
Dj (z) in a model-independent fashion [e.g. 97, 141]. 
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6.3 Recent results on the Hubble constant measurements 


The combined analysis of Planck SZ-effect measurements and XMM-Newton data 
for 61 clusters at z < 0.5 leads to Ho = 6743kms~! Mpc! [124]. The analysis 
takes advantage of resolved SZ and X-ray data to jointly model the pressure, den- 
sity and temperature profiles in clusters. The ratio between the pressure profiles 
derived from each observable is decomposed in two terms. The first term involves 
cosmology dependencies, among them H and the helium abundance; the second 
term incorporates what is described in the next paragraph as systematic uncertain- 
ties, i.e. clumpiness, asphericity, etc. The latter component is constrained via the use 
of hydrodynamic simulations [204] providing a quantitative prior on its distribution. 

An independent analysis of the pressure profiles of 14 dynamically relaxed clus- 
ters [selected following the criteria in 152] combined with Planck and Bolocam S-Z 
data provides Ho = 67.31713 kms~! Mpc™! [255]. The apparent larger uncertain- 
ties originate from inclusion of extra freedom in the analysis, aiming at characteriz- 
ing the (currently uncertain) temperature measurement bias specific to the Chandra 
calibration. This analysis demonstrates such a calibration uncertainty is the major 
source of error in their determination of Ho. Reversing the argument and assum- 
ing Ho is known from CMB data or from Cepheid variables, the sample is used to 
constrain the temperature measurement bias. 


6.4 Sources of systematic uncertainties 


The fgas tests resort to galaxy cluster total mass measurements. Cosmological results 
are therefore sensitive to the biases and uncertainties in those measurements, such 
as the hydrostatic bias introduced in the discussion of Eq. 15. Minimizing the bias 
and scatter around true mass values may be achieved by carefully selecting morpho- 
logically relaxed and isolated clusters, those systems being more likely to display 
low mass accretion rate and gentle dynamical disturbance. Calibration of measured 
masses against numerical simulations provide expectations and prior values for the 
deviations from truth. 

Clumping of gas on scales smaller than the resolution of the instrument leads 
to an enhancement of the X-ray emission, leading typically to a positive bias in 
the value of Ho inferred from the X/SZ test. Finoguenov et al. [77] looked at the 
source of clumping suggesting that pressure fluctuations contribute only 5% and the 
main source is entropy fluctuations. The latter can be removed in X-ray images us- 
ing the choice of the special band from imaging [40]. Systematic uncertainties in 
the galactic absorbing column density and the absolute calibration of the tempera- 
ture measurement are small, but relevant effects to consider when using clusters as 
standard candles [e.g. 26]. 

An additional source of uncertainty stems from the gradient in helium (He) abun- 
dance within the cluster. Numerical simulations indicate moderate helium sedimen- 
tation in the low-temperature outskirts of clusters, however core regions are poten- 
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tially richer in helium due to the drift of higher-mass atoms toward cluster centres 
[e.g. 185]. Not accounting for such gradients may induce biases in the recovered gas 
densities in central regions, due to X-ray emissivities being function of the abun- 
dances of all atoms, including He [e.g. 199, 33]. This translates into a systematic 
uncertainty on the measurement of Ho, fgas, total masses and thus on cosmological 
parameters. As an example, Peng & Nagai [185] predict up to 10% bias on Ho from 
the X/SZ test performed within R2500 if helium sedimentation is unaccounted for. 
Reversing the argument, prior knowledge on the Hubble constant and cluster shapes 
enables constraining the fraction of Helium in central cluster regions [156]. 


6.5 Distance measurements from spectra of X-ray resonant lines 


The X/SZ test presented above involves measurements of the cluster size along the 
line-of-sight, specifically a quantity proportional to “7, that is the SZ Compton-y 
parameter. A similar observable can be constructed in the X-ray domain, considering 
the optical depth T ~ Zm; of resonant lines in high-resolution spectra of X-ray 
light passing through the intracluster medium. While most lines in the ICM X-ray 
spectrum are optically thin (t < 1), some strong lines show optical thicknesses t = 
1, such as Fe Xxv at 6.7 keV in big clusters. Incoming light at the line frequency is 
absorbed and reemitted in a random direction. Resonant scattering thus modifies the 
shape, decreases the amplitude [e.g. 227, 83, 41] and possibly imprints polarization 
[e.g. 217] of these lines and at the same time it leaves intact non-resonant lines. 

Measurements of the optical depth may involve resonant absorption lines in the 
spectra of light passing through the galaxy cluster and emitted either from a back- 
ground AGN [129] or — most likely — from an AGN hosted in a cluster core [215]. 
This technique involves comparisons of on-target (AGN+ICM) observations with 
off-target (ICM only) emission line measurements to estimate the optical thickness 
and, in turn, the angular diameter distance to these clusters and constraints on the 
Hubble constant and its evolution. 

Alternatively, resonant emission lines originating from the ICM itself may be 
used in measuring the optical depth. Molnar et al. [164] estimate a few thousand 
photons in the ICM iron 6.7 keV resonant line at 2 — 4 eV spectral resolution lead 
to a 5%-error measurement on T, provided 7 > 2; their calculation predicts a 10% 
statistical uncertainty in the measurement of D4 (z) from 100 clusters at 0 < z < 1, 
which translates into a 16% uncertainty (3-0 level) on Qm in a flat Universe. In 
both cases (absorption or emission), measurements are limited by the small-scale 
motions in the medium - turbulence, typically — that broadens all lines and de- 
creases resonant lines optical thickness. Quantitative Monte-Carlo simulations help 
in separating turbulence from resonant scattering [e.g. 225, 266]. In the perspec- 
tive of using such a method to measure Ho, an important recent observational result 
is the measurement of resonant scattering in the Fe xxv Hea line in the Hitomi 
5 eV-resolution spectra emitted by the Perseus cluster core [95]. 
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7 Cluster internal mass distributions 


The hierarchical growth of structure leaves an imprint on the galaxy cluster internal 
mass distribution. Following this argument, cosmological information must be en- 
coded in the mass profiles and X-ray cluster analyses play an important role in this 
respect [e.g. 34, 45, 221]. The popular Navarro-Frenk-White (NFW) model [172] 
for the total matter density distribution p (r) in halos has free parameters rs and ôo: 


p(r) E do (30) 


* (Je 


This expression may be recast in terms of the concentration c4 = rA/r; by introduc- 
ing the total mass MA (see Eq. 12) within a given overdensity radius rA, providing: 


3 
& = 4 z S 
3 In(14- c4) — cA/(1 4- cA) 

The relation between mass and concentration predicted by numerical simulations 
(and its evolution) is sensitive to the cosmological model. 

One expression of this relationship is the dependence of the concentration upon 
the background density of the Universe at the time the halo collapses. Using vari- 
ous subsamples drawn from a pool of 44 clusters with deep X-ray observations and 
assuming the NFW matter distribution for the corresponding halos, Ettori et al. [66] 
obtained a set of cosmological constraints. Such an analysis provides constraints on 
the combination Sg = ogQ!): from 26 clusters with robust measurements, it returns 
T ~ 0.60 and Sg = 0.45 + 0.01, and for a subset of 11 low-entropy core clusters it 
gives I' = 0.56 and Sg = 0.39+0.02. X-ray data plays a key role in this work, as it 
provides the mass and concentration from the assumption of hydrostatic equilibrium 
(Eq. 15). The likelihood analysis considers incorporating gas fraction measurements 
in order to break the strong degeneracy along the Qm dimension. The reported con- 
straints amount to og = 1.0 +0.2 and Qm = 0.26 + 0.02, the former constraint shift- 
ing to og = 0.83 43-0.1 when only relaxed objects are included. This analysis, albeit 
providing tight constraints independent from cluster abundance studies, highlights 
the sensitivity of the method to the model concentration-mass relation and more 
generally to the model of dark matter distributions as a function of cosmology. 

Nevertheless, the NFW parametric profile fails at providing a good fitting func- 
tional for halos not entirely relaxed and (or) dynamically disturbed. For those sys- 
tems, little information is gained from the inferred concentration-mass relation and 
thus on the underlying cosmological model. Analyses of N-body numerical sim- 
ulations shows a correlation between the fraction of profiles departing from the 
NFW model and the cosmological model; this led to introduce 'sparsity' as a 
new, parameter-free, cosmological probe of cluster mass distributions [11]. Spar- 
sity 5A, A, is defined as the ratio of masses computed at two different overdensities 
chosen to encompass the intermediate- to large-radial range: $4, A, — MA, /M, A, With 
A, < 42. Typically, A; ~ 200 and A» ~ 500, but other radii are sensible choices as 
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long as A; = 100 and A» < 2000. Simulations indicate small scatter in this value and 
importantly, small percent-level mass dependence [50]; even at z~ 1 [14]. Using this 
property and knowing the predicted mass function at two different overdensity radii 
and at a given redshift bridges sparsity and the model cosmology [11, 49]: 


max 
Ma, dn 


mgin dM, 


(s4, ,4)Ma dn 
dinMa, = 5214) f dlnMa, (32) 


(5A, a5) Mg dM, 


where the integration runs over the mass range of the cluster sample and brackets 
indicate the average across the sample. Various estimators of this average exist for a 
given sample; any inconsistency among estimates reveals possible systematic biases 
in the determination or in the extrapolation of mass profiles [50]. Application of the 
sparsity test to a sample of 104 clusters with masses estimated from X-ray data as- 
suming hydrostatic equilibrium leads to Qm = 0.42 +0.17 and og = 0.80 + 0.31 
[49]; degeneracy between parameter constraints makes Sg = os /Qq = 0.48 +0.11 
more relevant an output of this test. This result takes advantage of an increased 
sensitivity of the sparsity test at around z ~ 0.5, as shown by analyses of a large 
mock catalogue of halos demonstrating the ability of the method to recover in- 
put cosmological parameters. Various systematic biases under scrutiny are found to 
have mild impact on the results; most prominent are selection effects and a radial- 
dependent hydrostatic mass bias. Hydrodynamical simulations incorporating the ef- 
fect of baryons indeed indicate a few percent impact from the latter. Finally, the 
results are sensitive to the choice of a theoretical mass function and the previously 
quoted constraints are slightly offset by a change in this assumption. 


8 Pink elephants 


According to the scenario for structure formation depicted in Sect. 2, we do not ex- 
pect massive clusters to have formed at high redshifts. Because cluster detection sig- 
nals scale positively with mass, especially at X-ray wavelengths, an extreme-mass 
system should easily be discovered in survey data if it exists. Exceptional massive 
structures therefore offer a promising test of the concordance model ?. Similarly as 
for cluster count studies, a number of effects and systematics must be understood 
before concluding to falsification of the cosmological model; among them are the 
uncertainties in the theoretical mass function [e.g. 99], the mass calibration and the 
scatter in the mass-observable relation [e.g. 165, 88], especially its impact on cluster 
selection [e.g. 102]. 

The recent discovery of massive high-redshift systems in wide surveys trig- 
gered an increase of interest in this area of research. Among others we may cite 
XMMU J2235 at z = 1.393 [168], RDCS1252.9 at z = 1.24 [147] or SPT-CL J2106- 


1? Or variations thereof, in particular non-Gaussianities in the primordial fluctuation field, see also 
note 18. By condordance model, we mean the most commonly accepted cosmological model, such 
as that depicted in Sect. 2. 
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5844 at z= 1.132, the most massive z > 1 cluster found in the early SPT survey [79], 
with Mogg. ~ 10M... Abundant literature followed these discoveries, feeding a de- 
bate on their ‘rareness’ in the context of the standard model and its extensions. As 
of today, it seems none of these systems indicates clear tension with ACDM. For 
the purpose of this Chapter, we follow Harrison & Hotchkiss [89] in classifying sta- 
tistical tests into two categories: tests relying on extreme value statistics and those 
based on the rareness of a given cluster of a certain mass and redshift in ACDM. A 
discussion of extreme pairwise velocities is appended, that comes as an interesting 
complement to these tests. 

This section entails studies of galaxy clusters in X-rays, although the search for 
extreme objects represents a wide field of investigations in cosmology, involving 
structures of many kinds: walls, voids, superclusters [e.g. 228, 183, 212]. 


8.1 Extreme-value statistics 


We may examine the statistics of extrema in a random field smoothed with a kernel 
of a given scale within a volume of the Universe (a ‘patch’) of a given size [44]. 
The probability distribution of the highest peak equals the probability Po(v) that no 
local maximum with normalized density Vmax exceeds a certain threshold value v 
[e.g. 19]: 


v 
Prob. (Vmax < v) = / DG(Vmax)d Vmax = Po(v) (33) 


where pG(Vmax) denotes the distribution of local maxima among patches; this is 
the statistics of interest here, easily obtained with pg = dP) /dv. The ‘count-in-cell’ 
formalism allows further progress. One has [e.g. 258]: 


Po(v) = exp[—n(> v)Vo(N.)] (34) 


with N: = n(> v)vėł, The expression for Ne comprises the average number of 
peaks above threshold, n(> v), computed over the volume of interest V; clearly 
Ne represents the typical number of halos exceeding v in an overdense patch. In 
general o is a function involving all the halo correlation functions Ek of order N, 
averaged over the volume V. A simple formula is obtained in the limit of massive 
peaks (v >> 1) and large separations [e.g. 18]: 


sie (1+5) e^9, with 8e? — y. (35) 


Releasing some of these assumptions? allows to enlarge the range of applicability 
of this statistics to smaller patches, smaller masses and smaller separations between 
halos. An interesting limiting case is: 


20 Tt is beyond the scope of this Chapter to review the assumptions leading to this formula, the 
interested reader may refer to Colombi et al. [44]. 
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o(N-)~1—Ne/2 (if Ne « 1) (36) 


For N, = 0, the set of equations presented above corresponds to the so-called ‘Pois- 
sonian’ form. An evaluation of its validity is carried in Davis et al. [55], both analyt- 
ically and with the help of a numerical experiment using an N-body simulation. The 
analytic comparison with the complete formula involving halo clustering (Eqs. 34 
and 35 and invoking a linear halo bias b) shows good agreement for patches of size 
L ~ 10057! Mpc. The comparison with N-body simulated patches demonstrates 
good performance of the full formula for patches of sizes above ~ 20^! Mpc. In 
all cases the agreement is better at the high-mass end of the distributions (large v 
and large separations). 

The expression in the ‘Poissonian’ limit is exploited by Waizmann et al. [254] 
with the specific goal of relating the shape of pg(m) to the following parametric 
form: 

Pagv(u) — exp [-(1-- y) "|, z= (u—a)/b (37) 


with free parameters a, b, y. This distribution is sometimes called ‘General Extreme 
Value statistics’ (GEV). With an assumption for the expression of the halo mass 
function n( m) [e.g. 229, 238] and by performing first-order Taylor expansion 
of both pg and parv around their maximal values [see e.g. 55, for a derivation], 
one obtains analytic expressions for the parameters (a,b, y), with a dependence on 
cosmology and on the growth of structure. In particular, the study shows that the 
most-likely extreme halo mass in a patch of sky, directly related to a, is a sensitive 
probe of the ACDM model. To demonstrate this result, Waizmann et al. [254] re- 
cover a from a fit of the full cumulative distribution of extreme masses extracted 
from ~ 500 independent Universe patches at 1 < z « 1.5 with a limiting sensitivity 
M ~ 10142571 M... The proposed cosmological test therefore consists in measuring 
the mass of the most extreme cluster in each patch with a certain precision, then 
group clusters in mass bins to form a cumulative distribution and finally fit the GEV 
formula (Eq. 37) to retrieve the best-fit a and use its value to discriminate between 
various dark energy models or gravitation theories. 

In an investigation on constraints on non-Gaussianities, Chongchitnan & Silk 
[39] provide an independent and detailed derivation of the extreme value statistics 
for halos above a given mass threshold (Eq. 34, 35); of particular interest is their 
analytic development carried for weak non-Gaussianities of the underlying field, al- 
lowing to expand o one order beyond the approximation of Eq. 36. The probability 
that XMMUJ0044.2 [214] is the most massive system in the XDCP survey is consis- 
tent with fyn = 0; the usual degeneracy between og and fyi requires fixing its value 
to that of WMAP-7. The assumptions made on the shape of the mass function play 
an important role in this analysis, all the more at higher redshift (XMMUJ0044.2 
has redshift z — 1.579). This work does not involve fitting or asymptotic formula 
such as Eq. 37. However the convergence of the distribution towards this family of 
functions is found to be insensitive to the value of fw; . 
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8.2 Rareness of events 


One may ask the question of whether a particular system is the most massive in a 
given survey. For instance, Cayón et al. [38] examine the case of XMMU2235 at 
z ~ 1.4. Poisson statistics serve in deriving the posterior probability for fy, and os, 
using a mass function modified to account for local non-Gaussianities. The study 
examines various systematics including Eddington bias, assumptions on the con- 
centration parameter involved in the mass determination and triaxiality as well as 
assumptions on the theoretical mass function. This work finds a tentative positive 
value for fy (see Sect. 4 for a definition of this parameter), although zero is not 
excluded. 

As pointed out by several authors [e.g. 101, 89] some of the statistics quantifying 
rareness of a single cluster found at a given mass and redshift suffer from a bias due 
to their neglecting the full population of extreme clusters of different masses and 
redshifts. Low probability values delivered by a set of “exclusion curves’ [as pro- 
posed in e.g. 165] do not necessarily point to exceptional systems after accounting 
is made for the underlying distribution of halos in the mass-redshift space. How- 
ever such tabulated probabilities proved useful in estimating the probability that no 
cluster is more massive and more distant than a given object [e.g. 107]. In this re- 
gard, rareness of a sample of clusters does not imply falsification of the ACDM 
model: an instructive critical view may be acquired with the help of Monte-Carlo 
numerical simulations [102]. For instance, Kim et al. [115] provide a new estimate 
of the mass of SPTJ2106; using one of the rareness statistics of Hotchkiss [101] 
their study assesses a discovery probability of ~ 72% within the survey volume. 
This value is higher than the initial estimate [~ 7%, 79] thanks to their updates of 
the mass measurement and of the statistical treatment. An additional rareness study 
of XMMUJ0044.2 is described by Tozzi et al. [240] following deep Chandra X-ray 
observations and a new estimation of its (hydrostatic, isothermal) mass; the authors 
consider its existence as an additional hint tentatively indicating tension in the stan- 
dard ACDM model or a significantly higher value for og than found by WMAP-7. 


6.3 Extreme pairwise velocities 


A complementary test involves pairwise velocities in major cluster mergers, whose 
extreme amplitudes are predicted as exceptional in the concordance model. Current 
X-ray observatories can provide information on the shock velocity for collisions 
near the plane of sky [e.g. 157, 54]. Observational searches at CCD spectral resolu- 
tion with Chandra hints towards a tentative detection of bulk motions perpendicular 
to the plane of sky in the gas of the Bullet Cluster [146], however robust velocity 
measurements require high-resolution X-ray spectroscopy [as in 94], a technique 
soon to be routinely available with modern X-ray observatories [171, 262]. The Bul- 
let Cluster (1E0657-56) is one cluster concentrating most of attention given its well 
measured properties, in particular the shock velocity (vs =~ 4700 km s^ !). However 
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the shock speed might be higher than the subcluster speed, the latter amounting to 
~ 3000 km s^! [e.g. 233], hence lowering the exceptional character of this system. 
Aided with a large N-body simulation Watson et al. [257] found no inconsistency 
with the ACDM model, although this study seems to be missing a complete account 
for systematics. The simulation-based work in Bouillot et al. [30] also concludes 
to an absence of inconsistency, complementing findings from independent studies 
[e.g. 125, 133]. The debate is still open though, as many more extreme systems such 
as the Bullet cluster would seriously challenge the standard model. The ‘El Gordo’ 
system, a galaxy cluster merger at z = 0.87, might be one such object that is rais- 
ing an inconsistency given its mass M299 = (2 — 3) x 10! Mo and infall velocity 
Vinf ~ 2500km s~! [8]. 


9 Clusters as gravitational theory probes 


The scenario for structure formation in the ACDM model described in Sect. 2 puts 
gravity at the centre of the play. The distribution, appearance, properties of galaxy 
clusters therefore strongly depend on the underlying law of gravity; detailed stud- 
ies offer tests of the gravitation theory at scales between cosmological and galactic. 
Modifications to the General Relativity have an impact on the abundance of clusters, 
on their shape, on the gravitational potential they live in and on virtually any obser- 
vational feature we have discussed in this Chapter. We refer the reader to recent 
reviews [e.g. 127, 57, 123, 35, 72] for a comprehensive overview and results from 
tests of gravity including galaxy clusters. In this Chapter we briefly describe three 
example studies specifically relying on X-ray cluster observations and samples. 

As mentioned in previous sections, experiments probing the abundance of X-ray 
galaxy clusters may constrain the exponent y parametrizing the linear growth-rate 
f(z) = g(z) = Qmn(z)", thereby offering a (so far successful) consistency check of 
General Relativity [e.g. 201, 202, 203]. Constraining alternative theories of gravita- 
tion necessitates a full understanding of the non-linear growth of structures and halo 
collapse in these frameworks. Analytic calculations provide insight into the scales 
and mass regimes most affected by such modifications [e.g. 31]. N-body simula- 
tions complement these works and provide fits to the modified halo mass function, 
for instance in Hu-Sawicki f(R) gravity and Dvali-Gabadadze-Porrati models [e.g. 
178, 71, 87]. The reanalysis of the abundance of X-ray cluster samples puts con- 
straints on the absolute value of the perturbation parameter fro entering one spe- 
cific model of /(R) modified gravity, to less than 107^? (at 95% confidence level) 
[220, 36]. A value of zero for fro indicates consistency with ACDM, while devi- 
ations from this model are of the order of this perturbation parameter. Systematics 
for this class of tests of modified gravity are related to uncertainties on the halo 
mass function, cluster selection effects and the mass-observable link, similarly as 
discussed in Sect. 3. 

A distinct use of clusters for testing gravity relies on their matter distribution. Fol- 
lowing the proposal of Verlinde [247] for a theory of “Emergent Gravity’, a number 
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of analyses concentrated on comparisons of the observational X-ray properties of 
galaxy clusters to the radial distribution of the apparent dark matter. The latter arises 
in the theory in a form that depends only on the baryonic mass (predominantly as 
X-ray emitting gas) and Ho. Whether estimating total masses from hydrostatic equi- 
librium equations [67] or weak-lensing measurements [236, 268] of several tens of 
systems, no preference is found for such an alternative gravitation theory, at least in 
its current form. 

The gravitational potential may be probed through the gravitational redshift ex- 
perienced by photons losing energy as they escape deep potential wells. The shape 
of the potential is changed as gravity is modified. Stacking together the spectroscop- 
ically measured galaxy velocities in a large sample of optical clusters, Wojtak et al. 
[261] identified a shift of velocity for satellite galaxies relative to centrals, indicative 
of the sought-after gravitational redshift signal. Further analyses incorporating all 
relevant relativistic effects and consistent changes in the halo mass function however 
conclude to little discriminating power of this technique if only dynamical mass in- 
formation for a narrow range of cluster masses is available [e.g. 265]. A study using 
X-ray-based masses obtained for ~ 2500 galaxy clusters detected in the SPIDERS 
survey also concludes to no preference between General Relativity and a special 
case of f (R) gravity [167]. In all these studies, the amount of the measured veloc- 
ity shift is of order 10 — 20 km s^!. The future use of high-resolution spectroscopic 
X-ray data may probe gravitational redshifts of the gas instead of individual galax- 
ies [e.g. 32, 213]. This observational technique provides a reduction of the noise 
inherent to measurements due to the high number of gas parcels in comparison to 
galaxies, at the expense of finely controlling all effects creating ~ few 10kms~! 
line shifts in the spectral emission (such as turbulent and bulk motions). Only with 
upcoming instrumentation will this observational technique be accessible, together 
with the observational and theoretical improvements in our understanding of the 
intra-cluster gas dynamics and chemical enrichment. 


10 Selection function 


When the catalogs of galaxy clusters are derived using X-ray data, X-ray properties 
of clusters and methods of cluster detection form the selection function of the survey. 
The shape of the X-ray emission of the cluster in combination with the detection 
method affects the selection: unresolved X-ray emission (typically covering 2 — 
6' radius) formed the basis for the bulk of cluster catalogs based on ROSAT All- 
Sky survey data, which for distant clusters covers most of the cluster, while for the 
nearby (z « 0.1) clusters it covers only the core, where X-ray emission scales poorly 
with cluster mass. While accounting for the total flux of the cluster is performed, 
such selection leads to unbalanced representation of merging and relaxed clusters. 
Another drawback of this approach is that 90% of such sources are not clusters and 
one has to clean for contamination in the sample and in cluster X-ray properties. 
A huge improvement in the purity of cluster selection came with using the extent 
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of X-ray emission. The selection function in those surveys was found to depend 
on the cluster core radii. In order to assess the limitations of the complexity of 
the cluster shapes, cloning of observed clusters has been used to test the cluster 
detection algorithms. 

Scatter in the cluster shape has been found to correlate with scatter in cluster X- 
ray luminosity, which affects cluster selection [112]. The full account for covariant 
scatter on cluster selection is given in Finoguenov et al. [76]. In terms of the selec- 
tion, the best area of the cluster to use seems to be between 1/4 and 1/2 of R509., and 
efforts have been made to create the detection procedures for that [64]. Avoiding 
cluster cores in the detection has been widely used in deep X-ray surveys [77, 78]. 
The advantage one uses in those methods is the large information on clusters shapes 
available at X-rays. 

In the forward modelling, for each prediction of mass function, one predicts the 
full set of cluster X-ray mass proxies and mimics the selection process. Detection 
of galaxy clusters forms the selection function, while the procedure of measurement 
of the cluster properties forms the cluster distribution function. On the procedure 
of measurements of cluster properties, there is a large variety of approaches taken. 
Even to obtain the cluster Ly one has to determine the cluster extent. However, 
in most cases only the central part of the cluster is detected and extrapolation of 
cluster properties is required. For that, one either uses a beta profile, which is either 
measured [248] or is taken from scaling cluster libraries [76], or sets the definition of 
cluster Ly to be within the zone of likely detection (e.g. 300 kpc for XXL). REFLEX 
and NORAS surveys [25] do not have a scalable definition of Ly and corrections 
have been computed for those clusters [150, 189]. 

The purpose of deriving a selection function is to emulate the behaviour of a 
detection chain that led to the construction of a sample, while varying assumptions 
on the physics and cosmological model. Expensive end-to-end simulations are an 
option which accounts for many of the details [projection effects, point-source con- 
tamination, etc., see e.g. 46], while scaling models are an alternative that bypasses 
the expensive computations, but need to formulate strong assumptions on their be- 
haviour. CODEX, XXL selection functions are somehow half-way between these 
two extremes. In order to make sure that certain cluster shapes are not missed in 
the current X-ray libraries, studies of clusters selected using different ways (e.g. SZ 
effect, galaxy concentrations or lensing) are used. The most obvious result of those 
searches so far is finding that clusters without the cool core are more abundant, 
compared to the results of REFLEX/NORAS catalogs. Similar results are obtained 
using spatially resolved X-ray searches for clusters. 

In most cases, identification of galaxy clusters requires detection of cluster galax- 
ies. Limitations of these data will affect the selection of clusters and of importance 
is the covariance of optical and X-ray properties of the sample. The current results 
[70] also suggest the anti-covariance, which implies that cluster entering the sam- 
ple by upscatter in X-ray properties, might leave the sample due to downscatter 
in optical richness. The scatter of properties in clusters is linked to the physics of 
clusters. X-ray luminous clusters have bright cores and are preferentially relaxed. 
Merger clusters tend to have high richness [170]. The study of Klein et al. [118] has 
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introduced a concept of using cluster identification to study the chance association 
between the X-ray source and the optical counterpart. Using an overlap with Klein 
et al. sample, Finoguenov et al. [76] concluded that chance identification starts to be 
important below the 10% completeness boundary of the X-ray survey. 


11 Conclusions and forward look 


X-ray clusters characterize well the massive end of halo mass function, and there 
are no limitation imposed on future studies to extend the X-ray cluster searches to 
redshifts beyond the current record of 2.5. The future studies will benefit from im- 
proved sensitivity in the soft X-ray band (0.1-2 keV) (e.g. of Athena) and spatial 
resolution better than a few arcseconds, as anticipated from the Lynx mission. Large 
catalogs of clusters delivered by eROSITA all sky will serve a great input for cos- 
mological studies. The best behaved part of the X-ray emission of clusters is located 
between 0.2 and R5ooc, where X-ray (which is necessarily a surface brightness) se- 
lection have best chances to be complete. 

Currently, z « 0.5 X-ray cluster studies have reached the limit of systematics. 
This requires both mass calibration and improved modelling of all baryonic cluster 
components. A variety of shapes of X-ray emission of clusters might be associated 
with a difference in the cluster baryonic content induced by AGN feedback. In or- 
der to understand that, detailed X-ray studies are required to recover the diversity 
of gas mass fraction close to the virial radius. Historically, these searches have been 
hampered by problems with robust definitions of massive halos using optical cluster 
catalogs and deficiencies in existing X-ray catalogs at low-z. A number of Large 
Programs on XMM-Newton are currently underway to progress on this issue. Deep 
X-ray and spectroscopic surveys have also been proven to be the best way to address 
this [73, 48]. A special topic is whether the distribution of X-ray properties of clus- 
ters follows the power law dependence on mass with a log-normal scatter, or has a 
more complicated functional form. With increasing size of X-ray cluster samples, 
and increasing variety of cluster detection methods, one expects to collect more data 
on this subject. 

Variety of baryonic properties of halos also affects weak lensing cosmology and 
present a new research focus on X-ray groups and clusters. For instance the high pre- 
cision of Euclid optical and near-infrared measurements will be sensitive to detailed 
processes associated with galaxy formation, including the growth of black holes 
and their feedback (co-evolution). Semboloni et al. [224] predict the importance of 
the contribution of galaxy groups to the shear power spectrum down to 10^ M... 
Debackere et al. [58] confirm this finding through modelling the distribution of 
baryons in groups following observations of stellar and hot intragroup medium. 
Baryonic feedback processes, either from supernovae (SNe) or active galactic nuclei 
(AGN) redistribute the gas, which changes the matter power spectrum, in particular 
on scales where the sensitivity of cosmic shear is maximal. Semboloni et al. [224] 
showed that ignoring this effect leads to large biases, whereas marginalising over 
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current model uncertainties weaken the constraining power of weak lensing shear 
experiments. The way forward is to constrain feedback directly and update its im- 
plementation in models [e.g. 161]. Groups are the best targets for resolving this issue 
because on their scale: on one hand the dominant feedback mechanism must change 
from stellar processes, such as galactic winds to AGN-driven processes and imprint 
telltale signatures on the warm gas, as the binding energy and the AGN output are 
similar; on the other hand, all the components of galaxy groups can be robustly de- 
termined and groups of galaxies are the lowest mass structures that impact the shear 
signal on the scales of interest for Euclid cosmology. 


12 Resources 


Any initial analysis of X-ray observations is performed with a mission-dependent 
software package, with a common goal of producing calibrated event list and gen- 
erating auxiliary responses. For a collection of the software on various mission we 
would like to point the reader to HEASARC?!. Most of the data products has been 
stored using FITS format with the FTOOLS software, available on the same site, 
as well as a package for spectral analysis, XSPEC. Detailed study of line emission 
shall benefit the largest database of lines to date, available through another spectral 
fitting code, SPEX?". 

Cosmological codes relevant to galaxy clusters are available through Astropy^ 
[9] and COLOSSUS? [62]. Cosmology codes in FORTRAN and other languages 
are available from E. Komatsu’s website~>. A cosmic variance python module [130] 
is available online?6. 
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